GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT 
val leu his his his gin asn pro glu phe 

62 

CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC 
gin leu his glu lys his his leu leu leu 

122 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC 
ser lys gly ser thr lys lys arg asp val 

182 

CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA 
pro leu leu lys asp gly arg val val thr 

242 

CTT CCT TCG GGC TAT CTT GGC TAC CAA GAG 
leu pro ser gly tyr leu gly tyr gin glu 

302 

ATT AAA TGG GTA GAT GGA GGC AAG CCA CTG 
ile lys trp val asp gly gly lys pro leu 

362 

GTG TAT ACT CAG GAT CAG CAT TTA CAT AAT 
val tyr thr gin asp gin his leu his asn 

422 

GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA 
gly ala gin ala leu gly asn glu leu val 

482 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT 
gly his val met ile ala phe leu pro thr 

542 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC 
arg ala thr gin glu glu val ala val asn 

602 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC 
gin cys his glu glu gly leu glu ser his 

662 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG 
ala glu pro tyr val ala ser glu tyr lys 



TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
tyr asp glu ile lys ile glu leu pro thr 

92 

ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
thr phe phe his val ser cys asp asn ser 

152 

GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
val glu thr gin val gly tyr ser trp leu 

212 

AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
ser glu gin his ile pro val ser ala asn 

272 

CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu gly met gly arg his tyr gly pro glu 

332 

CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
leu lys ile ser thr his leu val ser thr 

392 

TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
phe phe gin tyr cys gin lys thr glu ser 

452 

AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
lys tyr leu lys ser leu his ala met glu 

512 

ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
ile leu asn gin leu phe arg val leu thr 

572 

GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
val thr arg val ile ile his val val ala 

632 

TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
leu arg ser tyr val lys tyr ala tyr lys 

692 

ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
thr val his glu glu leu thr lys ser met 



111 

ACC ACG ATT CTC AAG CCT TCT GCC GAT TTC 
thr thr ile leu lys pro ser ala asp phe 

782 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT 
trp phe phe phe asp val leu ile lys ser 

842 | Cadherin Cleavage | 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT 
val lys leu leu arg asn gin arg phe pro 

902 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG 
val asn met leu met pro his ile thr gin 

962 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC 
asn ala asn his ser leu ala val phe ile 

1022 

TTT GTC TTC AAG CAG ATC AAC AAC TAC ATT 
phe val phe lys gin ile asn asn tyr ile 

1082 

CTC TTT GAA TAC AAG TTT GAA TTT CTC CGT 
leu phe glu tyr lys phe glu phe leu arg 

1142 

TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC 
leu asn leu pro met pro phe gly lys gly 

1202 

GAC TAC TCA TTA ACA GAT GAG TTC TGC AGA 
asp tyr ser leu thr asp glu phe cys . arg 

xxx| 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG 
glu val gly thr ala leu gin glu phe arg 

1322 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT 
lys asn leu leu ile lys his ser phe asp 

1382 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT 
arg ile ala thr leu tyr leu pro leu phe 

1442 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG 
asn val arg asp val ser pro phe pro val 



752 

CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 
leu thr ser asn lys leu leu arg tyr ser 

812 

ATG GCT CAG CAT TTG ATA GAG AAC TCC AAA 
met ala gin his leu ile glu asn ser lys 

872 

GCA TCC TAT CAT CAT GGA GCG GAA ACC GTT 
ala ser tyr his his ala ala glu thr val 

932 

AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 
lys phe gly asp asn pro glu ala ser lys 

992 

AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
lys arg cys phe thr phe met asp arg gly 

1052 , 

AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 
ser cys phe ala pro gly asp pro lys thr 

1112 

GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 
val val cys asn his glu his tyr ile pro 

1172 

AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
arg ile gin arg tyr gin asp leu gin leu 

1232 | Cadherin EC 

AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asn his phe leu val gly leu leu leu arg 

1292 

GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val arg leu ile ala ile ser val leu 

1352 

GAC AGA TAT GCT TCA AGG AGC CAT CAG GCA 
asp arg tyr ala ser arg ser his gin ala 

1412/471 

GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 
gly leu leu ile glu asn val gin arg ile 

1472 

AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn ala gly met thr val lys asp glu ser 



V 



1502 

CTG GCT CTA CCA GCT GTG AAT CCG CTG GTG 
leu ala leu pro ala val asn pro leu val 

1562 

AGC CTG CAC AAG GAC CTG CTG GGC GCC ATC 
ser leu his lys asp leu leu gly ala ile 

1622 

ACT CCA AAC ATC AAC AGT GTG AGA AAT GCT 
thr pro asn ile asn ser val arg asn ala 

1682 

TCG GGT AAC AGC CTT CCA GAA AGG AAT AGT 
ser gly asn ser leu pro glu arg asn ser 

1742 

CAA AGT AGC ACA TTG GGA AAT TCC GTG GTT 
gin ser ser thr leu gly asn ser val val 

1802 

AAG AGC CTA CTG ATG TGT TTC CTC TAC ATC 
lys ser leu leu met cys phe leu tyr ile 

1862 

ACA TAT TGG AAC AAG GCT TCA ACA TCT GAA 
thr tyr trp asn lys ala ser thr ser glu 

1922 

TGC CTG CAC CAG TTC CAG TAC ATG GGG AAG 
cys leu his gin phe gin tyr met gly lys 

1982 

GGA CCC ATA GTT CAT GAT CGA AAG TCT CAG 
gly pro ile val his asp arg lys ser gin 

2042 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC 
met met his ala arg leu gin gin leu gly 

2102 

AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG 
ser tyr gly his ser asp ala asp val leu 

2162 

ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG 
thr glu val cys leu thr ala leu asp thr 

2222 

CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT 
gin leu leu ala asp his gly his asn pro 

2282 

TGT TTT CTT CAA AAA CAT CAG TCT GAA ACG 



1532 

ACG CCG CAG AAG GGA AGC ACC CTG GAC AAC 
thr pro gin lys gly ser thr leu asp asn 

1592 

TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 
ser gly ile ala ser pro tyr thr thr ser 

1652 

GAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 
asp ser arg gly ser leu ile ser thr asp 

1712 

GAG AAG AGC AAT TCC CTG GAT AAG CAC CAA 
glu lys ser asn ser leu asp lys his gin 

1772 

CGC TGT GAT AAA CTT GAC CAG TCT GAG ATT 
arg cys asp lys leu asp gin ser glu ile 

1832 

TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 
leu lys ser met ser asp asp ala leu phe 

1892 

CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 
leu met asp phe phe thr ile ser glu val 

1952 

CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 
arg tyr ile ala arg asn gin glu gly leu 

2012 

ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 
thr leu pro val ser arg asn arg thr gly 

2072 

AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
ser leu asp asn ser leu thr phe asn his 

2132 

CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 
his gin ser leu leu glu ala asn ile ala 

2192 

CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
leu ser leu phe thr leu ala phe lys asn 

2252 

CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 
leu met lys lys val phe asp val tyr leu 

2312 

GCT TTA AAA AAT GTC TTC ACT GCC TTA AGG 



cys phe leu gin lys his gin ser glu thr 
2342 

TCC TTA ATT TAT AAG TTT CCC TCA ACA TTC 
ser leu ile tyr lys phe pro ser thr phe 

2402 

CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC 
leu cys tyr glu ile leu lys cys cys asn 

2462 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC 
ser gin leu leu tyr phe leu met arg asn 

2522 

GTC CGG ACA CAT TTG CAA GTC ATC ATA TCT 
val arg thr his leu gin val ile ile ser 

2582 

ATT GGG GAA ACC AGA TTC CAG CAG TCC CTG 
ile gly glu thr arg phe gin gin ser leu 

2642 

CGG CTT ATT AAG CAC ACC AGC TTC TCC TCT 
arg leu ile lys his thr ser phe ser ser 

2702 

ACG GTG CTA ATG GCC ACC GCC CAG ATG AAG 
thr val leu met ala thr ala gin met lys 

2762 

GAC CTC CAG TAC AGC CTG GCC AAA TCC TAT 
asp leu gin tyr ser leu ala lys ser tyr 



ala leu lys asn val phe thr ala leu arg 
2372 

TAT GAA GGG AGA GCG GAC ATG TGT GCG GCT 
tyr glu gly arg ala asp met cys ala ala 

2432 

TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
ser lys leu ser ser ile arg thr glu ala 

2492 

AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
asn phe asp tyr thr gly lys lys ser phe 

2552 

GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
val ser gin leu ile ala asp val val gly 

2612 

TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 
ser ile ile asn asn cys ala asn ser asp 

2672 

GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
asp val lys asp leu thr lys arg ile arg 

2732 

GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
glu his glu asn asp pro glu met leu val 

2792 

GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
ala ser thr pro glu leu arg lys thr trp 



2822 2852 | xxxxxxxxxxxxxxx Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

Transmembrane Domain xxxxxxxxxxxxxxxxxxxxxxxxxx | 

TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA GGC GTG TTT AGA CAA 
tyr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

2942 2972 

GGA TGC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 
gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 

3002 3032 

GAC GTG GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT GTG CTG ATG GAG CTC CTT GAG CAG 
asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 

3062 3 °92 

TGC GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 



cys ala asp gly leu trp lys ala glu arg 
3122 

ATC ATC CCC ATT TAT GAG AAG CGG AGG GAT 

ile ile pro ile tyr glu lys arg arg asp 

3182 

ATT TAC AAG GAA CCC AAA CTC ACA CCG CTG 
ile tyr lys glu pro lys leu thr pro leu 

3242 

TAC TCG GAT AAA TTT GGT TCT GAA AAT GTC 
tyr ser asp lys phe gly ser glu asn val 

3302 

CCT AAG GAT CTG GAT TCT AAG TAT GCA TAC 
pro lys asp leu asp ser lys tyr ala tyr 

3362 

GAC GAA AAA GAG TTG CAA GAA AGG AAA ACA 
asp glu lys glu leu gin glu arg lys thr 

3422 

TTC ATG TTT GAG ATG CCA TTT ACG GAG ACC 
phe met phe glu met pro phe thr gin thr 

3482 

TGC AAA CGG CGC ACC ATC CTG ACA GCC ATA 
cys lys arg arg thr ile leu thr ala ile 



tyr glu leu ile ala asp ile tyr lys leu 
3155 

TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
phe phe glu asp glu asp gly lys glu tyr 

3212 

TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 
ser glu ile ser gin arg leu leu lys leu 

3272 

AAA ATG ATA CAG GAT TCT GGC AAG GTC AAC 
lys met ile gin asp ser gly lys val asn 

3332 

ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
ile gin val thr his val ile pro phe phe 

3392 

GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
glu phe glu arg ser his asn ile arg arg 

3452 

GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 
gly lys arg gin gly gly val glu glu gin 

3512 

CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
his cys phe pro tyr val lys lys arg ile 



3542 

CCT GTC ATG TAC CAG CAC CAC ACT GAC CTG 
pro val met tyr gin his his thr asp leu 

3 602 xxx Coiled-Coil 1 xxxxxxxxxxxxx 
AGT AAG AAG GTG GCG GAG CTC CGG CAG CTG 
ser lys lys val ala glu leu arg gin leu 

3662 xxxxxxxxxxxxxxxxxxxxxx J 

CTG CAG CTC AAA CTC CAG GGC AGC GTG AGT 

leu gin leu lys leu gin gly ser val ser 

3722 

GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA 
ala arg ala phe leu asp asp thr asn thr 

3782 

CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA 
leu lys glu val phe arg gin phe val glu 

3842 xxx Coiled-Coil 2 xxxxxxxxxxxxx 
CGT CTG ATT AAA GAA GAC CAG CTC GAG TAT 



3572 Ixxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
asn pro ile glu val ala ile asp glu met 

3632 xxxx Coiled-Coil 1 xxxxxxxxxxxxx 
TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
cys ser ser ala glu val asp met ile lys 

3692 

GTT CAG GTC AAT GCT GGC CCA CTA GCA TAT 
val gin val asn ala gly pro leu ala tyr 

3752 

AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 
lys arg tyr pro asp asn lys val lys leu 

3812 | xxxxxxxxxxxxxxxxxx 

GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
ala cys gly gin ala leu ala val asn glu 

3872 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 



arg leu ile lys glu asp gin leu glu tyr gin glu glu met lys ala asn tyr arg gl u 



3902 xxx Coiled-Coil 2 xxxxxxxxxxxxx 3932 xxxxx | 

ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG ATC TGC CCC CTG GAG GAG AAG ACG 
met ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 

3962 3992 

AGC GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 
ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser thr 

4022 | xxxx PBM xxxx| 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
met val his gly met thr ser ser ser ser val val STP 

4082 4112 

TGG GGA CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 
4142 4172 

ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 
4202 4232 

TCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 
4262 4292 

GCA ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 
4322 4352 

CTT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 
4382 4412 

GTT CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG" GGG GAC CTT TTG 
4442 4472 

CCT CGA CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG TAC AGA ACT TAC TAG TTT TGT CTA 
4502 4532 

GGA GTA TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 
4562 4592 

TAG TGA GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG CTA CTG GCT TCA AGT CAG AAC TTT 
4622 4652 

GTC ATT AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT ATT ACA TTT CTA CAT TTT TAA TAC 
4682 4712 

TCA CAT GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 

I 

4742 4772 

ATA CAC TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 

4802 
TTT ACT 




rie,.zA 



GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 

val leu his his his gin asn pro glu phe tyr asp glu ile lys ile glu leu pro thr 

CAG CTG CAT G A A A AG CAC CAC CTG TTG CTC ACA TTC TTC CAT GTC AGC TGT G AC AAC TCA 

gin leu his glu lys his his leu leu leu thr phe phe his val ser cys asp asn ser 

1SE 152 

AGT AAA GG A AGC ACG A AG A AG AGG GAT GTC GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 

ser lys gly ser thr lys lys arg asp val val glu thr gin val gly tyr ser trp leu 



CCC CTC CTG AAA G AC GGA AGG GTG GTG ACA AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 

pro leu leu lys asp gly arg val val thr ser glu gin his ile pro val ser ala asn 

EMS 272 

CTT CCT TCG GGC TAT CTT GGC TAC CAA GAG CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 

leu pro ser gly tyr leu gly tyr gin glu leu gly met gly arg his tyr gly pro glu 



3DE 



33E 



ATT AAA TGG GT A GAT GGA GGC A AG CCA CTG CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
ile lys trp val asp gly gly lys pro leu leu lys ile ser thr his leu val ser thr 



GTG TAT ACT CAG GAT CAG CAT TTA CAT AAT TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
val tyr thr gin asp gin his leu his asn phe phe gin tyr cys gin lys thr glu ser 



GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
gly ala gin ala leu gly asn glu leu val lys tyr leu lys ser leu his ala met glu 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
gly his val met ile ala phe leu pro thr ile leu asn gin leu phe arg val leu thr 

SMS 572 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
arg ala thr gin glu glu val ala val asn val thr arg val ile ile his val val ala 

bDE b32 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC TTG AGG TCA TAT GTT AAG TAC 6CG TAT AAG 
gin cys his glu glu gly leu glu ser his leu arg ser tyr val lys tyr ala tyr lys 

bbE ^ 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
ala glu pro tyr val ala ser glu tyr lys thr val his glu glu leu thr lys ser met 



FI6t.Z€) 



T 



722 7SE 

ACC ACG ATT CTC A AG CCT TCT GCC GAT TTC CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 
thr thr ile leu lys pro ser ala asp phe leu thr ser asn lys leu leu arg tyr ser 

7fiE fllE 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT ATG GCT CAG CAT TTG ATA GAG AAC TCC AAA 
trp phe phe phe asp val leu ile lys ser met ala gin his leu ile glu asn ser lys 

fllE I Cadherin Cleavage I 67E 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT GCA TCC TAT CAT CAT GCA GCG GAA ACC GTT 
val lys leu leu arg asn gin arg phe pro ala ser tyr his his ala ala glu thr val 

IDE 132 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT GGA GAT A AT CCA GAG GCA TCT AAG 
val asn met leu met pro his ile thr gin lys phe gly asp asn pro glu ala ser lys 

1b2 112 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
asn ala asn his ser leu ala val phe ile lys arg cys phe thr phe met asp arg gly 

1022 1DS2 

TTT GTC TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 
phe val phe lys gin ile asn asn tyr ile ser cys phe ala pro gly asp pro lys thr 

lOSfi 1112 

CTC TTT GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 
leu phe glu tyr lys phe glu phe leu arg val val cys asn his glu his tyr ile pro 

11M2 1172 

TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
leu asn leu pro met pro phe gly lys gly arg ile gin arg tyr gin asp leu gin leu 

1E0E 1235 I Cadherin EC 

GAC TAC TCA TTA ACA GAT GAG TTC TGC AGA AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asp tyr ser leu thr asp glu phe cys arg asn his phe leu val gly leu leu leu arg 

xxx I 1212 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val gly thr ala leu gin glu phe arg glu val arg leu ile ala ile ser val leu 

1322 13S2 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT GAC AGA TAT GCT TCA AGG AGC CAT CAG GCA 
lys asn leu leu ile lys his ser phe asp asp arg tyr ala ser arg ser his gin ala 

1352 1M12 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 
arg ile ala thr leu tyr leu pro leu phe gly leu leu ile glu asn val gfcn arg ile 

mM2 1M72 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn val arg asp val ser pro phe pro val asn ala gly met thr val lys asp glu ser 



lSQE 1532 

CTG GCT CTA CCA GCT GTG A AT CCG CTG GTG ACG CCG C AG A AG GGA AGC ACC CTG G AC A AC 
leu ala leu pro ala val asn pro leu val thr pro gin lys gly ser thr leu asp asn 

15bE 15^2 

AGC CTG CAC A AG G AC CTG CTG GGC GCC ATC TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 
ser leu his lys asp leu leu gly ala ile ser gly ile ala ser pro tyr thr thr ser 

ACT CCA A AC ATC AAC AGT GTG AGA A AT GCT GAT TCG AG A GGA TCT CTC ATA AGC ACA GAT 
thr pro asn ile asn ser val arg asn ala asp ser arg gly ser leu ile ser thr asp 

TCG GGT AAC AGC CTT CCA GAA AGG A AT AGT GAG AAG AGC A AT TCC CTG GAT AAG CAC CAA 
ser gly asn ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys his gin 

17M2 177H 

CAA AGT AGC ACA TTG GGA A AT TCC GTG GTT CGC TGT GAT AAA CTT GAC C AG TCT GAG ATT 
gin ser ser thr leu gly asn ser val val arg cys asp lys leu asp gin ser glu ile 

16Q2 lfi3S 

AAG AGC CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 
lys ser leu leu met cys phe leu tyr ile leu lys ser met ser asp asp ala leu phe 

IflbS 1612 

ACA TAT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 
thr tyr trp asn lys ala ser thr ser glu leu met asp phe phe thr ile ser glu val 

1^22 nsE I xxxxxxxxxxxxxxxxxxxx 

TGC CTG CAC CAG TTC CAG TAC ATG -GGG AAG CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 
cys leu his gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 

1162 xxxxxxxxxx deleted in CLASP-2D (KIAA105 8) xxxxxxxxxxxxxxxxxxxxxx I 

GGA CCC ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 

gly pro ile val his asp arg lys ser gin thr leu pro val ser arg asn arg thr gly 

2012 2D72 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
met met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn his 

21D2 2132 

AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 
ser tyr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn ile ala 

Deleted 

21bS lxxx 
ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
thr glu val cys leu thr ala leu asp thr leu ser leu phe thr leu ala phe lys asn 

in HC2B 

xxx I 22S2 

CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 

gin leu leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 



1 



EEflE 2312 

TGT TTT CTT CAA AAA CAT C AG TCT G A A ACG GCT TTA AAA AAT GTC TTC ACT GCC TTA AGG 
cys phe leu gin lys his gin ser glu thr ala leu lys asn val phe thr ala leu arg 

2342 S37E 

TCC TTA ATT TAT A AG TTT CCC TCA ACA TTC TAT GAA GGG AG A GCG GAC ATG TGT GCG GCT 
ser leu ile tyr lys phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

EMQE 2432 

CTG TGT TAC GAG ATT CTC A AG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyr glu ile leu lys cys cys asn ser lys leu ser ser ile arg thr glu ala 

2 Mb 2 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
ser gin leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 

2S22 "52 

GTC CGG ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
val arg thr his leu gin val ile ile ser val ser gin leu ile ala asp val val gly 

ESSE 2bl2 

ATT GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC ATC AAC AAC TGT GCC AAC ACT GAC 
ile gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

2bM2 2^2 

CGG CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
arg leu ile lys his thr ser phe ser ser asp val lys asp leu thr lys arg ile arg 



E70E 



E73E 

ACG GTG CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
thr val leu met ala thr ala gin met lys glu his glu asn asp pro glu met leu val 



27b2 

GAC CTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
asp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2a22 E6S2 ixxxxxxxxxxxxxxx Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

[Additional and differential exon usage found at position 2*127 consisting 
of bl nucleotides. This entire sequence is found in Human CLASP— 2D 
(KIAAlDSfi) and not other isoforms of CLASP-E- It has a sequence of: 
AAGCAGTCCAGTGGGAGCCGCCCCTTCTCCCCCACAGCCATAGCGCCTGCCTGAGGAGGAGCCGGGGAG3 

Transmembrane Domain xxxxxxxxxxxxxxxxxxxxxxxxxx I K : 

TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA SGC GTG TTT AGA CAA 

tyr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

E1ME 2^72 

GGA TGC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 
gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 



r 



3002 | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Sequence deleted in CLASP-2E xxxxx 

GAC GTG GGG ATG C AG GAT GTC CAT TTC A AC GAG GAT GTG CTG ATG GAG CTC CTT GAG CAG 
asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 

3QbB xxxxxxxxxxxxxxxxxx I 3012 

TGC GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC G AC ATC TAC AAA CTT 
cys ala asp gly leu trp lys ala glu arg tyr glu leu ile ala asp ile tyr lys leu 



[Additional and differential exon usage found at position 31S3- The 
entire sequence below is found in Human CLASP-2D- Underlined sequence is 
found in Human CLASP-2B , 2C and 2E - 

TGAGAGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCAT 
GCACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGGCAGCGCAATACCAGTTT 
ACAGACAGTGAAACAGATGTGGAGGGATT3 . 

3122 T 31SS 

ATC ATC CCC ATT TAT GAG A AG CGG AGG GAT TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
ile ile pro ile tyr glu lys arg arg asp phe phe glu asp glu asp gly lys glu tyr 

31SE 3212 

ATT TAC AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 
ile tyr lys glu pro lys leu thr pro leu ser glu ile ser gin arg leu leu lys leu 

32M2 3272 

TAC TCG GAT AAA TTT GGT TCT GAA A AT GTC AAA ATG ATA CAG GAT TCT GGC AAG GTC AAC 
tyr ser asp lys phe gly ser glu asn val lys met ile gin asp ser gly lys val asn 

3302 3332 

CCT AAG GAT CTG GAT TCT AAG TAT GCA TAC ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
pr lys asp leu asp ser lys tyr ala tyr ile gin val thr his val ile pro phe phe 

33b2 3312 

GAC GAA AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
asp glu lys glu leu gin glu arg lys thr glu phe glu arg ser his asn ile arg arg 

3M22 3MS2 

TTC ATG TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 
phe met phe glu met pro phe thr gin thr gly lys arg gin gly gly val glu glu gin 

3MA2 3S12 

TGC AAA CGG CGC ACC ATC CTG ACA GCC ATA CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
cys lys arg arg thr ile leu thr ala ile his cys phe pro tyr val lys lys arg ile 

Two nucleotide deletion (nts 3Sfib and 3S&7) found in Human CLASP-2C 

1 

3S i,g 3S72 IxXxl 

CCT GTC ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
pro val met tyr gin his his thr asp leu asn pro ile glu val ala ile asp glu met 



net. z^ctcivt) 



3b02 3L32 

AGT AAG A AG GTG GCG GAG CTC CGG CAG CTG TGC TCC TCG GCC GAG GTG G AC ATG ATC AAA 
ser lys lys val ala glu leu arg gin leu cys ser ser ala glu val asp met ile lys 



3bb2 

CTG CAG CTC AAA CTC CAG GGC AGC GTG AGT 

leu gin leu lys leu gin gly ser val ser 

3722 

GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA 

ala arg ala phe leu asp asp thr asn thr 

3762 

CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA 

leu lys glu val phe arg gin phe val glu 

3AMS- 

CGT CTG ATT AAA GAA GAC CAG CTC GAG TAT 

arg leu ile lys glu asp gin leu glu tyr 



3b12 

GTT CAG GTC AAT GCT GGC CCA CTA GCA TAT 
val gin val asn ala gly pro leu ala tyr 

37S2 

AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 
lys arg tyr pro asp asn lys val lys leu 

3fll2 

GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
ala cys gly gin ala leu ala val asn glu 

3fl?2 

CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 
gin glu glu met lys ala asn tyr arg glu 



Insertion of A nucleotides found only in Human CLASP- 2D with sequence: CTGGGATG 
3102 3132 I 

ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG ATC TGC CCC CTG GAG GAG AAG ACG 
met ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 

3^2 3^12 

AGC GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 
ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser thr 

MD22 I xxxx PBM xxxx I 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
met val his gly met thr ser ser ser ser val val STP 

Moaa mis 

TGG GGA CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 
M1M2 M17H 

ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 
M202 M232 

TCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 
M2b2 M212 

GCA ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 

t 

M322 M3S2 

CTT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 
M3fi2 MM12 

GTT CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG GGG GAC CTT TTG 



CCT CGA CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG TAC AGA ACT TAC TAG TTT TGT CT A 

M 502 MS3E 

GGA GTA TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 

MSbE MS'IS 

TAG TGA GCT AAC TGA GCT AGA CAC TCA ATT A AT CCG CTA CTG GCT TCA AGT CAG AAC TTT 

GTC ATT A AT CAT CGA CTC CGG GAC GGT CAT ATA TGT ATT ACA TTT CTA CAT TTT TAA TAC 

MbflE M712 

TCA CAT GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 

M7ME M77E 

ATA CAC TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 



MflOE 
TTT ACT 



HC2A 

HC2-80 

HC2B 

HC2C 

HC2D- KIAA1 058 GCZATCTGGAAATCTTGACAAAAATGCCAGATTTTCTGCCATCTACAGGCAAGACAGCAAT 

HC2E 

HC2F 

HC2A 

HC2-80 

HC2B 

HC2C 

HC2D- KIAA10 5 8 AAGCTATCCAATGATGACATGCTCAAGTTACTTGCAGACTTTCGGAAACCTGAGAAGATG 

HC2E 

HC2F 

HC2A 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 GCTAAGCTCCGAGTGATTTTAGGGAATCTAGACATTACAATTGATAATGTTTCCTCAGAC 

HC2E 

HC2F 

HC2A 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 TTCCCTAATTATGTTAATTCATCATACATTCCCACAAAACAATTTGAAACCTGCAGTAAA 

HC2E 

HC2F 

HC2A 

HC2-80 

HC2B 

HC2C 

HC2D- KIAA1 058 ACTCCCATCACGTTTGAAGTGGAGGAATTTGTGCCCTGCATACCAAAACACACTCAGCCT 

HC2E 

HC2F 



HC2A 

HC2-80 

HC2B 

HC2C 

HC 2 D - KI AA1.0 5 8 TACACCATCT AC^CCAATCACCTTTACGTTTATCCTAAGTACTTGAAATACGACAGTCAG 

HC2E 1 __ 

HC2F 



HC2A 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 AAGTCTTTTGCCAAGGCTAGAAATATTGCGATTTGCATTGAATTCAAAGATTCAGATGAG 

HC2E 

HC2F -~ : 



HC2A 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 GAAGACTCTCAGCCCCTTAAGTGCATTTATGGCAGACCTGGTGGGCCAGTTTTCACAAGA 

HC2E 

HC2F 

HC2A AGTTTTACACCATCACCAAAACCCAGAATTTTATGATGAGATTAAA 

HC2-80 

HC2B 

HC2C 

HC2D - KIAA1 058 AGCGCCTTTGCTGCAGTTTTACACCATC^CCAAAACCC^GAATTTTATGATGAGATTAAA 

HC2E 

HC2F 



HC2A ATAGAGTTGCCCACTCAGCTGCATGAAAAGCACCACCTGTTGCTCACATTCTTCCATGTC 

HC2-80 

HC2B t 

HC2C 

HC2D- KIAA1 058 ATAGAGTTGCCCACTCAGCTGCATGAAAAGCACCACCTGTTGCTCA.CATTCTTCCATGTC 

HC2E 

HC2F 



HC2A AGCTGTGACAACTCAAGTAAAGGAAGCACGAAGAAGAGGGATGTCGTTGAAACCCAAGTT 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 AGCTGTGACAACTCAAGTAAAGGAAGCACGAAGAAGAGGGATGTCGTTGAAACCCAAGTT 

HC2E 

HC2F 



HC2A GGCTACTCCTGGCTTCCCCTCCTGAAAGACGGAAGGGTGGTGACAAGCGAGCAGCACATC 

HC2-80 

HC2B 

HC2C 

HC 2 D - KIAA1 05 8 GGCTACTCCTGGCTTCCCCTCCTG AAAGACGGAAGGGTGGTGACAAGCG AGCAGCACATC 

HC2E 

HC2F 



HC2A CCGGTCTCGGCGAACCTTCCTTCGGGCTATCTTGGCTACCAAGAGCTTGGGATGGGCAGG 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA10 5 8 CCGGTCTCGGCGAACCTTCCTTCGGGCTATCTTGGCTACCAGGAGCTTGGGATGGGCAGG 

HC2E ' 

HC2P 



HC2A CATTATGGTCCGGAAATTAAATGGGTAGATGGAGGCAAGCCACTGCTGAAAATTTCCACT 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 CATTATGGTCCGGAAATTAAATGGGTAGATGGAGGCAAGCCACTGCTGAAAATTTCCACT 

HC2E 

HC2F 



HC2A CATCTGGTTTCTACAGTGTATACTCAGGATCAGCATTTACATAATTTTTTCCAGTACTGT 

HC2-80 

HC2B 

HC2C 

HC2D- KIAA1 058 CATCTGGTTTCTACAGTGTATACTCAGGATCAGCATTTACATAATTTTTTCCAGTACTGT 

HC2E 

HC2F 



HC2A CAGAAAACCGAATCTGGAGCCCAAGCCTTAGGAAACGAACTTGTAAAGTACCTTAAGAGT 

HC2-80 

HC2B 

HC2C 

HC2D- KIAA1 058 CAGAAAACCGAATCTGGAGCCCAAGCCTTAGGAAACGAACTTGTAAAGTACCTTAAGAGT 

HC2E 

HC2F 



HC2A CTGCATGCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2-80 

HC2B GCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2C -r 

HC2D - KIAA1 058 CTGCATGCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2 E : - - GCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2F 



HC 2 A TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTCATT 

HC2-80 

HC2 B TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTCATT 

HC2C 

HC2D- KI AA1 0 5 8 TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTC ATT 

HC2E TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTC ATT 

HC2F 

HC2A ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 

HC2-80 

HC2B ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 

HC2C 

HC2D- KIAA1 0 58 ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 

HC2E ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 

HC2F -__ 

HC2A AAGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 

HC2-80 

HC2B AAGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 

HC2C , 

HC2D-KIAA1058 AAGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 

HC2 E AAGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 

HC2F 

HC2A CTGACCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAACAAA 

HC2-80 

HC2B CTGACCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAACAAA 

HC2C : 

HC2D - KIAA1 058 CTGACCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAACAAA 

HC2E CTGACCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAACAAA 

HC2F 

HC2A CTACTGAGGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATTTG 

HC2-80 

HC2B CTACTGAGGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATTTG 

HC2C 

HC2D-KIAA1058 CTACTGAAGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATTTG 

HC2E CTACTGAGGTACTCATGGTTTTTCTTTGATGTACTGATC^UVTCTATGGCTCAGCATTTG 

HC2F 

HC2A ATAGAGAACTCCAAAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATeCTATCATCAT 

HC2-80 

HC2B ATAGAGAACTCCAAAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATCCTATCATCAT 

HC2C 

HC2D-KIAA1058 ATAGAGAACTCCAAAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATCCTATCATCAT 

HC2E ATAGAGAACTCCAAAGTTAAGTTGCTGCG AAACCAGAGATTTCCTGCATCCTATCATCAT 

HC2F 



1 



HC2A GCAGCGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTGGAGATAAT 

HC2-80 

HC2B GCAGCGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTGGAGATAAT 

HC2C 

HC2D-KIAA10 5 8 GCAGTGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTCGAGATAAT 

HC2E GCAGCGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTGGAGATAAT 

HC2F 

HC2A , CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

HC2-80 

HC2B CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

HC2C 

HC2D- KIAA1 05 8 CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

HC2E CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

HC2F 

HC2A TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

HC2-80 . 

HC2B TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

HC2C 

HC2 D - KIAA1 05 8 TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

HC2E TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

HC2F -- 

HC2A GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

HC2-80 

HC2B GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

HC2C 

HC2D- KIAAX 058 GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

HC2E GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

HC2F 

HC2A GAACATTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC 

HC2-8 0 . 

HC2B GAACATTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC 

HC2C 

HC2D- KIAA1 05 8 GAACATTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC 

HC2E GAACATTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC 

HC2F 

HC2A CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTGTGCAGAAACCACTTCTTGGTG 

HC2-80 TCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 

HC2B CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 

HC2C 

HC2D- KIAA1 058 CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 

HC2E CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 

HC2F 



HC 2 A CC ATATACAACCTC AACTCCAAAC ATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 

HC2 - 8 0 CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 

HC2B CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 

HC2C 

HC2D- KIAA1 05 8 CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 

HC2E CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 

HC2F GCTGATTCGAGAGGATCT 

HC2A CTCATAAGGACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2-80 CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2 B CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2C 

HC2D- KIAA1 0 58 CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2 E CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2F CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

HC2A CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 

HC2-80 CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 

HC2B CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 

HC2C 

HC2D - KIAA1 05 8 CTGGAT AAGCAC CAACAAAGT AGCAC ATTGGGAAATT CCGTGGTTCGCTGTGATAAACTT 

HC2E CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 

HC2F CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 

HC2A GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 

HC2B ' GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 

HC2C 

HC2E GACCAGTCTGAGATTAAGAGGCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 

HG2F GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 

HC2A GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 

HC2-80 GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT. 

HC2B GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 

HC2C 

HC2D- KIAA10 5 8 GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 

HC2E GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGAT"rTT , rTT.' 

HC2 F GATGATGCTTTGTTTAGATATTGGAACAAGGCTTGAACATCTGAACTTATGGATTTTTTT 

HC2A ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 

HC2 - 8 0 ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 

HC2B ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 

HC2C 

HC2D - KIAA1 058 ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAG- 

HC2E ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 

HC2F ACAATATCTGAAGTCTGCCTGCAC CAGTTCCAGTACATGGGGAAGCGAT ACATAGCCAG - 



T 

HC2A AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 

HC2 - 80 AAC CAGG AGGGGTTGGG AC C CAT AGTTC ATG ATCG AAAGTCTCAG ACATTGC CTGTTT C C 

HC2B AACCAGGAGGGGTTGGG ACCCATAGTTCATGATCGAAAGTCTCAGAC ATTGCCTGTTTCC 

HC2C 

HC2D-KIAA1058 t AA 

HC2E AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 

HC2F TGTGA GAAAG ATATCAAGTGT 

HC2 A CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 

HC2 - 8 0 CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 

HC2B CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 

HC2C 

HC2D- KIAA1 058 C AGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGG ATAACTCT 

HC2E CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 

HC2F GCTTGGAA 

HC2A CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 

HC2 - 8 0 CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 

HC2B CTCACTTTTAACCACAGCTATGGCCA.CTCGGACGCAGATGTTCTGCACCAGTCATTACTT 

HC2C 

HC2D -KIAA1 0 58 CTTCACTTTTAACCACAGCTATGGCGACTCGGACGCAGATGTTCTGCACCA.GTCATTACTT 

HC2E CTCACTTTTAACCIACAGCTATGGCCACTCGGACGCIAGATGTTCTGGACCAGTCATTACTT 

HC2F - TTTCTGTAGACAATGGCTATGGCC^CTCGGACGCAGATGTTCTGCACCAGTCATTACTT 

HC2A GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACA.CGCTTTCTCTATTTACA 

HC2 - 8 0 GAAGCCAACATTGCrrACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 

HC2B GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 

HC2C " 

HC2D- KIAA1 058 GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 

HC2E GAAGCCAAC^TTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 

HC2F GAAGCCAACATTGCrTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 

HC2A TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 

HC2 - 8 0 TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 

HC2B TTGGCGTTTAAG CTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 

HC2C 

HC2D-KIAA1058 TTGGCGTTTAAGAACCAGCTCCTrGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 

HC2F TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAAAA 

HC2 A TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 

HC2-80 TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACG^CTTTAAAAAATGTC 

HC2B TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACG^CTTTAAAAAATGTC 

HC2C 

HC2D-KIAA1058 TTTGATGTCrACCTGTGTT^TTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 

HC2E TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACG^CTTTAAAAAATGTC 

HC2F A r- : 



1 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 
TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 
TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 

TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 
TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTGC 



GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTCC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 
ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 
ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 

ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 
" ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 
GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 
GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 

GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 
GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GCAGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 
GCAGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 
GCAGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 

GCAGACGTTGTTGGCATTGGGGGAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 
GCAGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 
TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 
TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 

TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTGTCCTCTGATGTGAAGGACTTA 
TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 



HC2A ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2 - 8 0 ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2B ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2C 

HC2 D - KIAA1 05 8 ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2E ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATG AGAACGAC 

HC2F 

HC2A CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 

HC2 - 8 0 CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 

HC2B CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 

HC2C 

HC2D - KIAA10 5 8 CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 

HC2E . CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 

HC2F 

HC2A CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

HC2-80 CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

HC2B CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

HC2C 

HC2D- KIAA10 5 8 CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

HC2 E CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

HC2F 

HC2 A GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 

HC2 - 8 0 GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 

HC2B GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 

HC2C 

HC2D- KIAA10 5 8 GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 

HC2 E GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 

HC2F 

HC2A G 

HC2-80 G 

HC2B G 

HC2C 

HC2D- KIAA1 058 GAAGCAGTCCAGTGGGAGCCGCCCCTTCTCCCCCACAGCCATAGCGCCTGCCTGAGGAGG 

HC2E G 

HC2F 

HC2A GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTAGCCCAAACATC 

HC2 - 80 GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2B GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2C GTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2D- KIAA1 058 AGCCGGGGAGGCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2E GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2F 



HC2A GACGAGGAGGCCTCCATGATGGAAGACGTG43GGATGCAGGATGTCCATTTCAACGAGGAT 

HC2 - 8 0 GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGGAGGATGTCCATTTCAACGAGGAT 

HC2B GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 

HC2C GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 

HC2D-KIAA1058 G ACG AGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTC AACGAGGAT 

HC2E GACGAGGAGGCCTCCATGATGGAAGACGTGGGGA 

HC2F 

HC2A GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 

HC2 - 8 0 GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 

HC2 B GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 

HC2C GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 

HC2D- KI AA10 5 8 GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 

HC2E AAGCCGAGCGCTACG AG 

HC2F 

HC2 A CTCATCGC CG AC ATCT AC AAACTTAT CATCCC CATTT ATG AG AAGC GG AGGGATTT 

HC2 - 8 0 CTCATCGC CGACIATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTT 

HC2 B CTCATCGCCGACATCTAC^AACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 

HC2C CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 

HC2D - KIAA1 058 CTCATTGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 

HC2 E CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 

HC2F 

HC2A 

HC2-80 

HC2B AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 

HC2C AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 

HC2D- KIAA1 058 AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 

HC2E AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 

HC2F 

HC2A 

HC2-80 

HC2B CACTCGGGCCGC^GGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG 

HC2 C CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG 

HC2E CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG 

HC2F 

HC2A CTTTGAAGATGAAGATGGA 

HC2 _aO CTTTGAAGATGAAGATGGA 

HC2B GATTCTTTGAAGATGAAGATGGA 

HC2C GATTCTTTGAAGATGAAGATGGA 

HC2D- KIAA1 058 CAATACCAGTTTACAGACAGTGAAACAGATGTGGAGGGATTCTTTGAAGATGAAGATGGA 

HC2E GATTCTTTGAAGATGAAGATGGA 

HC2F '-" 



HC2A 
HC2-80 
HC2B 
HC2G 

HC2D-KIAA1058 

HC2E 

HC2F 



AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGACACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTGAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCCTACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 



HC2A 
HC2 - 80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



G AAGAGCAGTGCAAACGGCGCAC CATC CTGACAGC C AT ACACTGCTT CCCTTATGTGAAG 
GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 
GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 
GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 



GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAAX058 

HC2E 

HC2F 



AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTTTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGT - - CCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA105 8 

HC2E 

HC2F 



GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA10 58 

HC2E 

HC2F 



CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 



HC2A 
HC2-8 0 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGGG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA10 58 

HC2E 

HC2F 



GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 



V 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E " 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 

TAC AGGGAAATGGCGAAGG AGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 

TAC AGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAGCTGGGATGATCTGCC 
TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 



CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 



GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGATTAC 
GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGATTAC 

GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGA 

GGACTCCAACAAGCACAATGGTTC^CGGGATGACCAGCTCGTCTTCGGTCGTGTGA 

GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGATTAC 
GGACTCCAACAAGCACAATGGTTC7VCGGGAT/GACCA.GCTCGTCTTCGGTCGTGTGA 



ATCTCATGGCCCGTGTGTGGGGACTTGCTTTGTCATTTGCAAACTCAGGATGCTTTCCAA 
ATCTCATGGCCCGTGTGTGGGGACTTGCTTTGTCATTTGCAAACTCAGGATGCTTTCCAA 

ATCTCATGGCCCGTGTGTGGGGACTTGCTTTGTCATTTGCAAACTCAGGATGCTTTCCAA 



AGCCAATCACTGGGGAGACCGAGCACAGGGAGGACCAAGGGGAAGGGGAGAGAAAGGAAA 
AGCCAATCACTGGGGAGACCGAGCACAGGGAGGACCAAGGGGAAGGGGAGAGAAAGGAAA 

AGC CAAT CACTGGGG AG ACCG AGCACAGGG AGG AC CA - GGGGAAGGGGAGAGAAAGGAAA 



TAAAGAACAACGTTATTTCTTAACAGACTTTCTATAGGAGTTGTAAGAAGGTGCACATAT 
TAAAGAACAACG1TATTTCTTAACAGACTTTCTATAGGAGTTGTAAGAAGGTGCACATAT 

TAAAGAACAACGTTATTTCTTAACAGACTTTCTATAGGAGTTGTAAGAAGGTGCACATAT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TTTTTTAAATCTCACTGGCAATATTCAAAGTTTTCATTGTGTCTTAACAAAGGTGTGGTA 
TTTTTTAAATCTCACTGGCAATATTCAAAGTTTTCATTGTGTCTTAACAAAGGTGTGGTA 



TTTTTTAAATCTCACTGGCAATATTCAAAGTTTTCATTGTGTCTTAACAAAGGTGTGGTA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA105 8 

HC2E 

HC2F 



GACACTCTTGAGCTGGACTTAGATTTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATG 
GACACTCTTGAGCTGGACTTAGATTTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATG 



GACACTCTTGAGCTGGACTTAGATTTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATG 



HC2A GCCTACAGAAAAAAAAGGTTCTGGGATCTACATGGCAGGGAGGGCTGCACTGACATTGAT 

HC2 - 8 0 GCCTACAGAAAAAAAAGGTTCTGGGATCTACATGGCAGGGAGGGCTGCACTGACATTGAT 

HC2B .-- 

HC2C 

HC2D- KIAA1 05 8 GCCTACAGAAAAAAAAGGTTCTGGGATCTACATGGCAGGGAGGGCTGCACTGACATTGAT 

HC2E : 

HC2F 

HC2 A GCCTGGGGGACCTTTTGCCTCGACTCGTGCCGGAAATCTGATCGTAATCAGGGTACAGAA 

HC2 - 8 0 GCCTGGGGGACCTTTTGCCTCGACTCGTGCCGGAAATCTGATCGTAATCAGGGTACAGAA 

HC2B 

HC2C 

HC2D- KIAA1 058 GCCTGGGGGACCTTTTGCCTCGAGGCTGAGCTGGAAAATCTTGAAAATATTTTTT T 

HC2E 

HC2F 



HC2A CTTACTAGTTTTGTCTAGGAGTATGTTGTATGACTAGGATTTGTGCTATTATCTCATTCA 

HC2-80 CTTACTAGTTTTGTCTAGGAGTATGTTGTATGACTAGGATTTGTGCTATTATCTCATTCA 

HC2B 

HC2C 

HC2D-KIAA1058 TTTCCTGTGGCACATTCAGGTTGAATACAAGAACTATTTTTGTGACTAGTTTTTGATGAC 

HC2E 

HC2F 

HC2A ACAACATAGAGCAAGAATAGTGAGCTAACTGAGCTAGACACTCAATTAATCCGCTACTGG 

HC2 - 8 0 ACAACATAGAGCAAGAATAGTGAGCTAACTGAGCTAGACACTCAATTAATCCGCTACTGG 

HC2B 

HC2C 

HC2D-KIAA1058 CTAAGGGAACTGACCATTGTAATTTTTGTACCAGTGAACCAGGAGATTTAGTGCTTTTAT 

HC2E 

HC2F 



HC2A CTTCAAGTCAGAACTTTGTCATTAATCATCGACTCCGGGACGGTCATATATGTATTACAT 

HC2 - 8 0 CTTCAAGTCAGAACTTTGTCATTAATCATCGACTCCGGGACGGTCATATATGTATTACAT 

HC2B 

HC2C 

HC2D-KIAA10 58 ATTCATTTCCTTGCATTTAAGAAAATATGAAAGCTTAAGGAATTATGTGAGCTTAAAACT 

HC2E r- 

HC2F 

HC2A TTCTACATTTTTAATACTCACATGGGCTTATGCATTAAGTTTAATTGTGATAAATTTGTG 

HC 2 - 8 0 TTCT ACATTTTTAATACTC AC ATGGGCTTATGCATTAAGTTTAATTGTG ATAAATTTGTG 

HC2B 

HC2C 

HC2D-KIAA1058 AGTCAAGCAGTTTAGAACC AAAGGCCTATATT AAT AACCGCAACTATGCTGAAAAGTAC A 

HC2E 

HC2F 



HC2A CTGGTCCAGTATATGCAATACACTTTAATGGTTTATTCTTGTCATAAAAATGTGCAATAT 

HC2 - 8 0 CTGGTCCAGTATATGCAATACACTTTAATGGTTTATTCTTGTCATAAAAATGTGCAATAT 

HC2B 

HC2C 

HC2D- KIAA1 058 AAGTAGTACAGTATATTGTTATGTACATATCATTGTTAATACAGTCCTGGCATTCTGTAC 

HC2E 

HC2F 

HC2A GGAGATGTATACAAGTCTTTACT 

HC2 - 8 0 GGAGATGTATACAAGTCTTTACT - : 

HC2B 

HC2C 

HC2D-KIAA1058 ATATATGTATTACATTTCTACATTTTTAATACTCACATGGGCTTATGCATTAAGTTTAAT 

HC2E 

HC2F 



HC2A 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 TGTGATAAATTTGTGCTGTTCCAGTATATGCAATACACTTTAATGTTTTATTCTTGTACA 

HC2E 

HC2F 



HC2 A ■■ 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 TAAAAATGTGCAATATGGAGATGTATACAGTCTTTACTATATTAGGTTTATAAACAGTTT 

HC2E 

HC2F 



HC2A 

HC2-80 

HC2B 



HC2D- KIAA10 5 8 TAAGAATTTCATCCTTTTGCCAAAATGGTGGAGTATGTAATTGGTAAATCATAAATCCTG 

HC2E " 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 . 

HC2E 

HC2F 



ATTTTC TTTT AAG ACAT T AA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TTTAGTAATTTTATATTTGGGAAAATAAAGGTTTTTAATTTTATTTAACTGGAATCACTG 



HC2A 

HC2-80 

HC2B " ------ 

HC2D- KIAAIO 5 8 CCCTGCTGTAATTAAACATTCTGTACCACATCTGTATTAAAAAGACATTGCTGACC 

HC2E 

HC2F : 



HC2A 

HC2A-8 0 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B . 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



ASGNLDKNARFSAIYRQDSNKLSNDDMLKLLADFRKPEKMAKLPVILGNLDITIDNVSSD 

FPNYVNSS YI PTKQFETCSKTP I TFEVEE FVPC I PKHTQPYT I YTNHLYVYPKYLKYDSQ 

VLHHHQNPE FYDE I K 

KS FAKARN I AI C I E FKDS DEEDS QPLKC I YGRPGGPVFTRSAFAAVLHHHQNPE FYDE I K 

IELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPLLKDGRVVTSEQHI 
IELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 

PVSANLPSGYLGYQELGMGPJTfGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC . 
PVSANLPSGYLGYQELGMGRHYGPEIKVJVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 

QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLTRATQEEVA\TNVTRVI 

AMEGHVMIAFLPTILNQLFRVLTRATQEEVAVNVTRVI 

QKTES GAQALGNELVKYLKS LHAMEGHVM I AFL PT I LNQL FRVLTRAT QEEVAVNVTRV I 
AMEGHVM I AFLPT I LNQLFRVLTRAT QEEVAVNVTRV I 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-8 0 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



IHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 



IHVVAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 



I HWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTT ILKPSADFLTSNK 
IHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 



LLRYSWFFFDVL I KSMAQHLI ENSKVKLLRNQRFPAS YHHAAETVVNMLMPH I TQKFGDN 



LLRYSWFFFDVL I KSMAQHL I ENSKVKLLRNQRFPAS YHHAAETVVNMLMPH I TQKFGDN 



LLKYSWFFFDVX I KSMAQHLI ENSKVKLLRNQRFPAS YHHAVETWNMLMPH I TQKFRDN 
LLRYSW FFFDVL I KSMAQHL I ENSKVKLLRNQRFPAS YHHAAET WNMLMPH I TQKFGDN 



PEASKNANHSLAVFIKRCFTFMDRGFVFKQINNYISCFAPGDPKTLFEYKFEFLRWCNH 

PEASKNMraSLAVFIKRCFTFMDRGFVFKQINNYISCFAPGDPKTLFEYKFEFLRVVCNH 

PEASKNANHSLAVFIKRCFT FMDRGFVFKQINNY I SCFAPGDPKTLFE YKFEFLRWCNH 
PEASKNANHSLAVFIKRCFTFMDRGFVFKQINNY I SCFAPGDPKTLFEYKFEFLRWCNH 



EHY I PLNLPMP FGKGR I QRYQDLQLDYS LTDE FCRNH FLVGLLLREVGTALQE FREVRL I 

_ QLDYSLTDEFCRNH FLVGLLLREVGTALQE FREVRL I 

EHY I PLNLPMP FGKGR I QRYQDLQLDYS LTDE FCRNH FLVGLLLREVGTALQE FREVRL I 

EHYI PLNLPMP FGKGR I QRYQDLQLDYS LTDE FCRNH FLVGLLLREVGTALQE FREVRL I 
EH Y I PLNLPMP FGKGR I QRYQDLQLDYS LTDE FCRNH FLVGLLLREVGTALQE FREVRL I 



AISVLKNLLIKHSFDDRYASRSHQARIATLYLPLFGLLIENVQRINVRDVSPFPVNAGMT 
AI S VLKNLL IKHS FDDRYASRSHQARIATLYLPLFGLL I ENVQR I NVRDVS P FPVNAGMT 
AI SVLKNLLIKHS FDDRYASRSHQARIATLYLPLFGLLIENVQRINVRDVSPFPVNAGMT 



AI S VLKNLL I KHS FDDRYASRSHQAR I ATLY LPLFGLL I ENVQR INVRDVS P FPVNAGMT 
AI S VLKNLL I KHS FDDR YASRS HQARI AT L YLPLFGLL I ENVQR INVRDVS P FPVNAGMT 



VKDESLALPAVNPLVTPQKGSTLDNSLHKDLLGAISGIASPYTTSTPNINSVRNADSRGS 
VKDES LALPAVNPLVT PQKGSTLDNS LHKDLLGAI SG I AS P YTTS TPN INS VRNADSRGS 
VKDE S LAL P AVNPL VT P QKGS T LDNS LHKD L LGAI S G IAS PYTTSTPNINS VRNADS RG S 

VKDES LALPAVNPLVT PQKGS TLDNS LHKDLLGAI SG IAS PYT TSTPN INSVRNADSRGS 
VKDES LALPAVNPLVT PQKGSTLDNSLHKDLLGAISGIASPYTTSTPNINSVRNADSRGS 
— " ADSRGS 



» 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



LISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSEIKSLLMCFLYILKSMS 
LISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSEIKSLLMCFLYILKSMS 
LISTDSGNSLPE RNS EKS NS L DKHQQS S T LGNSWRC DKLDQS E I KS LLMC FLY I LKSMS 



LISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSEIKSLLMCFLYILK5MS 
LISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSEIKSLLMCFLYILKSMS 
LISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSEIKSLLMCFLYILKSMS 



DDAL FT YWNKAS T S E LMD F FT I SE VC LHQ FQYMGKR Y I ARNQEGLG P I VH DRKS QT L P VS 
DDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTLPVS 
DDAiFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTLPVS 



DDAL FT YWNKAS T S E LMD FFT I S E VC LHQFQ YMGKRY I AR . 

DDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTLPVS 
DDALFTYWNKASTSELMDFFT I SEVCLHQFQ YMGKRY IAS VR — KISSVLGIS 



RNRTGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 
RNRTGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 
RNRTGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 



TGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 

RNRTGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 
v D-NG YGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 



IJVFKNQLIADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIYKFPSTFYEGRA 
LAFKNQLLADHGHNPLMKKVFDVYLC FLQKHQSETALKNVFTALRS L I YKFPST FYEGRA 
LAFK — LIjADHGHNPIjMKKVFDVYLCFLQKHQSETALKNVFTALRSLIYKFPSTFYEGRA 



IAFKNQLLADHGHNP LMKKVFDVYLCFLQKHQSETALKNV FT ALRSLI YKFPST FYEGRA 
lAFKNQLlJVDHGHNPIaMKKVFT)WLCFLQKHQSETALKNVFTALRSLIYKFPSTFYEGRA 
LAFKNQLLADHGHNPLMKKK 



DMCAALC YE I LKCCNSKLSS I RTEAS QLLY FLMRNN FDYTGKKS FVRTHLQV 1 1 S VS QL I 
DMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTHLQVIISVSQLI 
DMCAALCYE I LKCCNSKLSS IRTEASQLLYFLMRNN FDYTGKKS FVRTHLQVI ISVSQLI 



DMCAALCYE I LKCCNSKLSS IRTEASQLLYFLMRNN FDYTGKKS FVRTHLQVI ISVSQLI 
EMCAALCYEILKCCNSKI^SIRTEASQLLYFLMRNNFDYTGKKSFVRTHLQVIISVSQLI 



ADWGIGETRFQQSLSI INNCANSDRLIKHTSFSSDVKDLTKRIRTVLMATAQMKEHEND 
ADWGIGETRFQQS LS I INNCANSDRLIKHTS FSS DVKDLTKRIRTVLMATAQMKEHEND 
ADVVGIGETRFQQSI^IINNGANSDRLIKHTSFSSDVKDLTKRIRTVLMATAQMKEHEND 



ADWG IGGTRFQQS LS I INNCANS DRL I KHT S FSS DVKDLTKRI RTVLMAT AQMKE HEND 
ADWG I GE TR FQQS LS 1 1 NNCANS DRL I KHT S FS S DVKDLTKRI RTVLMATA134I^EHEND 



fid. 3B>cc^-.y 



HC2 A PEMLVDLQYS LAKS YAS TPELRKT WLDSMAR I HVKNGDLSEAAMC YVHVTALVAE YL TRK 

HC2A- 8 0 PEIMIjVDLQYSLAKSYASTPELRKTWLDSMARIHVKNGDLSEAAMCYVHVTALVAEYLTP^ 

HC2 B PEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKNGDLSEAAMCYVHVTALVAEYLTRK 

HC2C 

HC2 D PEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKNGDI^EAAMCYVWTALVAEYLTRK 

HC 2E PEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKNGDLSEAAMCYVHVTALVAEYLTRK 

HC2F 



hc2 a GVFRQGC TAFRV I T PN I DE EASMME DVGMQDVH FNE 

HC2A- 8 0 : GVFRQGCTAFRVI TPNIDEEASMMEDVGMQDVHFNE 

hc2 b GVFRQGCTAFRVI TPNIDEEASMMEDVGMQDVHFNE 

HC2 c FRQGCTAFRVI TPN I DEEASMMEDVGMQDVHFNE 

HC 2D EAVQWEPPLLPHSHSACLRRSRGGVFRQGCTAFRVITPNIDEEASMMEDVGMQDVHFNE 

HC2 E GVFRQGCTAFRVI TPN I DEEASMMEDVG 

HC2F 



HC2A DVLME LLE QCADG LWKAER YEL IAD I YKL 1 1 PIYEKRR 

HC2A-80 DVXMELLEQCADGLWKAERYELIADiYKLIIPIYEKRR 

HC 2B DVLMELLEQCADGLWKAERYELIADI YKLI I PIYEKRPJDFERLAHLYDTLHRAYSK 

HC 2c DVXMELLEQCADGLWKAERYELIADIYKLIIPIYEKPJU5FERLAHLYDTLHRAYSK 

HC2D DVLMELLEQCADGLWKAERYELIADI YKLI IPIYEKRRDFEKLAHIiYDTLHRAYSK 

HC2E " KAERYELIADI YKLI IPI YEKRRDFERLAHLYDTLHRAYSK 

HC2F — 



HC2A ■ ■ -DFFEDEDGKEYIYKEPKIjTPL.se 

HC2A-80 ; '■ DFFEDEDGKEYIYKEPKLTPLSE 

HC2B VTEVMHSGRRLLGTYFRVAFFGQ GFFEDEDGKEYIYKEPKLTPLSE 

Hc2 c VTEVMHSGRRLLGTYFRVAFFGQ GFFEDEDGKEY I YKEPKLTPLSE 

HC2D VTEVMHSGPJUXGTYFRVAFFGQAAQYQFTDSETDVEGFFEDEDGKEYIYKEPKLTPLSE 

HC2 E VTEVMHSGRRLLGTYFRVAFFGQ GFFEDEDGKEYIYKEPKLTPLSE . 

HC2F 



HC 2A I S QRLLKL YS DKFGS ENVKMI QDS GKVN PKDLDSKYAY I QVTHVI P FFDEKELQERKTE F 

HC2A-80 ISQRLLKLYSDKFGSENVKMIQDSGKVNPKDLDSKYAYIQVTHVIPFFDEKELQERKTEF 

HC2B I S QRLLKL YS DKFGS ENVKMI QDS GKVNPKDLDSKYAY I QVTHVI P FFDEKELQERKTE F 

HC2C ISQPOjLKLYSDKFGSENVKMTQDSGKVNPKDLDSKYAYIQVTHVIPFFDEKELQERKTEF 

HC 2D ISQRLLKLYSDKFGSENVKMIQDSGKVNPKDLDSKYAYIQVTHVIPFFDEKELQERKTEF 

HC 2E ISQRLLKLYSDKFGSFJJVKMIQDSGKVNPKDLDSKYAYIQVTHVIPFFDEKELQERKTEF 

HC2F 



HC2 A ERSHNIRRFMFEMPFTQTGKRQGGVEEQCKRRTILTAIHCFPYVKKRIFVMYQHHTDLNP 

HC2A- 8 0 ERSHN IRRFMFEMP FTQTGKRQGGVEEQCKRRT I LTAIHCFPYVKKRI PVMYQHHTDLNP 

HC2 B ERSHNIRRFMFEMPFTQTGKRQ^GVEEQCKRRTI LTAIHCFPYVKKRI PVMYQHHTDLNP 

HC 2C ERSHNIPxRFMFFJ4PFTQTGKRQGGVEEQCKRRTILTAIHCFPYVKKRIPFMYQHHTDI^ 

HC2 D ERSHNIRRFMFFJ4PFTQTGKRQGGVEEQCKRRTILTAIHCFPYVK3CRI PVMYQHHTDLNP 

HC2E ERS HN I RR FMFEMP FT QTGKRQGGVEE QCKRRT I LTAIHC FP YVKKRI PVMYQHHTDLNP 

HC2F 



HC2A IEVAIDEMSKKVAELRQLCSSAEVDMIKLQLKLQGSVSVQVNAGPIAYARAFLDDTNTKR 

HC2A- 8 0 IEVAIDEMSKKVAELRQLCSSAEVDMIKLQLKLQGSVSVQVNAGPLAYARAFLDDTNTKR 

HC2B I E VAI DEMS KKVAELRQLCSSAE VDMI KLQLKLQGSVSVQVNAGPLAYARAFIjDDTNTKR 

HC2C IEVHZ 

HC2D IE VAI DEMSKKVAELRQLCSSAEVmiKLQLKLQGSVSVQVNAGPLAYARAFLDDTNTKR 

HC2E IEVAI DEMS KKVAELRQLCSS AEVDM I KLQLKLQGS VSVQVNAGPLAYARAFLDDTNTKR 

HC2F 



HC2A YPDNKVKLLKEVFRQFVEACGQALAVNERLIKEDQLEYQEEMKANYREMAKELSEIMHEQ 

HC2A- 8 0 Y PDNKVKLLKE VFRQ FVEACGQALAVNERL I KEDQLE YQEEMKANYKEMAKELSE IMHEQ 

HC2B YPDNKVKLLKEVFRQ FVEACGQALAVNERL I KEDQLEYQEEMKANYREMAKELSEIMHEQ 

HC2C 

HC2D YPDNKVKLLKEVFRQFVFACGQALAVNERLI KEDQLEYQEEMKANYREMAKELSE IMHEQ 

HC2E YPDNKVKLLKEVFRQFVEACGQAIAVNERLIKEIX2LEYQEEMKANYREMAKELSEIMHEQ 

HC2F ; 

HC2A ICPLEEKTSVLPNSLHIFNAISGTPTSTMVHGMTSSSSVVZ 

HC2A-80 ICPLEEKTSVLPNSLHIFNAISGTPTSTMVHGMTSSSSWZ 

HC2B ICPLEEKTSVLPNSLHIFNAISGTPTSTMVHGMTSSSSWZ 

HC2C : ^ 

HC2D LG 

HC2E ICPLEEKTSVLPNSLHIFNAISGTPTSTMVHGMTSSSSWZ 

HC2F 
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HC2A > 

KIAA AS GNLDKNARFSAI YRQDSNKLSNDDMiKLLADFRKPEKMAKLPVI LGNLD I T I DNVS S D 

rat — ' : 

HC4 • 

HC1 

HC3 : 

HC5 ■■ 



Ht-^A 

KIAA FPNYVNS S YI PTKQFETCSKT PIT FEVEE FVPC I PKHTQPYT I YTNHLYVYPKYLKYDSQ 

rat = 

HC4 ~ < 

HC1 ■ 

HC3 ■ 

HC5 

HC2A VLHHHQNPEFYDEIK 

KIAA KS FAKARNIAI CIEFKDS DEEDSQPLKCI YGRPGGPVFTRSAFAAVLHHHQNPEFYDE I K 

rat : 

HC4 

HC1 ■ 

HG3 1 

HC5 ■■ 



HC2A IELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPLLKDGRVVTSEQHI 

KIAA IELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPLLKDGRVVTSEQHI 

rat : . ■ 

HC4 

HC1 

HC3 

HC5 



HC2A PVSANLPSGYLGYQELGMGRHYGPEIKWVTCGKPLLKISTHLVSTVYTQDQHLHNFFQYC 

KIAA PVSANLPSGYLGYQELGMGRHYGPE IKWVDGGKPLLKI S THLVS TVYTQDQHLHNFFQYC 

rat :- : 

HC4 ~ 

HC1 ■ 

HC3 : GPGPARS TVS I S L I SNSARV 

HC5 

HC2A QKTESGAQALGNELVKYLKSLHAMEGHVMIAFTjPTILNQLFRVLT-RATQEEVAV^ 

KIAA QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 

rat — • 

HC4 MEIQVLIRFLSVILMQLFWVLPNMIHEDDVPISCPMV 

HC1 MS FL P 1 1 LNQL FKVLV- QNEEDE ITT TVTRV 

HC3 NRSRSLSNSNPDI SGT PTS PDDEVRS I IGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 

HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



hee^eesh 



IIHWAQCHEEfelXSHLRSYVKYAYKAEPYVASEYKTVHEELTKiiMl'TILKPSADFLTSN 
1 1 HWAQCHEEGLES HLRS YVKYAYKAE P YVASE YKTVHEELTKSMTT I LKPS ADFLT SN 



LFHIVSKCHEEGLDS YLSS FIKYS FRPGKPSAPQAPLIHETLATMMIALLKQSADFLAIN 

LPD I VAKCHEEQLDHS VQS Y I KFVFKTR ACKERPVHEDLAKNVTGLLK-SNDSPTVK 

TQAMDRSCNRMSSHTETSSFLQTLTGRLP TKKLFHEELALQWWCSG — SVR E 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



Cleava 

KLLR Y S W FFFDVL I KSMAQHL I ENS KVKLLfeNQP 
KLLKYSWFFFDVLIKSMAQHLIENSKVKLLRNQF 


ge 

FPASYHHAAETWNMLMPHI TQKFGD 
FPAS YHHAVETVVNMLMPH ITQKFRD 


KLLKYSWFFFE 1 1 AKSMAT YLLEENKI KLT 
HVLKHSWFFFAI I LKSMAQHLI DTNKI QLP 
SALQQAWFFFELMVKSMVHHLYFNDKLEAP 


iGQB 
S.PQR 
*KSR 


FPKAYHHALHSLFLAIT- IVESQYAE 
FPESYQNELDNLVMVLSDHVIWKYKD 
FPERFMDDIAALVS T IAS DI VSRFQK 
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HC2A CRNHFLVGLLLREVGTALQEFRE VRLIAI SVLKNLLIKHS FDDRYASRSHQARIAT 

KIAA CRNHFLVGLLLREVGTALQEFRE VRLIAI SVLKNLLIKHS FDDRYASRSHQARIAT 

rat • — 

HC4 CKHHFLVGLLLRETS IALQDNYE IRYTAI SVIKNLLrKHAFDTRYQHKNQQAKIAQ 

HC1 CRKHFL I G I LLREVG FALQEDQD VRHLALAVLKNLMAKHS FDDRYREPRKQAQ IAS 

HC3 RQQHYLAGLVLTELAVI LDPDAEGLFGLHKKVI NMVHNLLS SHDS DPRYS DPQIKARVAM 

HC5 SSTS-SPGLLFTELAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAA 

HC2A LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 

KIAA LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 

rat 7 : 

HC4 LYLPFVGLLLENIQRLAGRDTLYS CAAMPNSASRDE FPCG FTSP — AN — RGSLS 

HC1 L YMPLYGMLLDNMPRI YLKDLYPFTVNTSNQGSRDDLS TNGGFQS QTAI KHANS VDTS FS 

HC3 LYLPLIGI IMETVPQLYDFTETHNQRGRP ICIATDDYESE SG SMI S 

HC5 LYLPLVGI ILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 



HC2A KDLLGAISGIASPYTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 

KIAA KDLLGAISGIASPYTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 

rat ; 

HC4 TDKDTAYGS FQNG HGIKREDSRGSLI P-EGATGFPDQGNTGEN TRQS 

HC1 KDVLNS IAAFSS- IAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPL 

HC3 QTVAMAIAGTSVPQ LTRPGS FLLTSTSGRQHT — 

HC5 QNVALAIAGNNFN : LKTSG- 1 VLSSLPYKQYN 
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TLGNSWRCDKLDQSEIKSLLMCFLYILKSMSDDALFTyWN-KASTSEIiMDFFTISEVCL 
TLGNSVVRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 



STRSSVSQYNRLDQYEIRSLLMCYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 
AIjIGSTLRFDRLDQAETRSLLMCFLHIMKTISYETLIAYWQ-RAPSPEVSDFFSILDVCL 

: TFSAESSRSLL i cllwvlkn- adetvlqkw ft dl s vlqlnrlldll ylcv 

MLNADTTRNLM I C FLW IMKN- ADQS L I RKW I ADLP S TQLNRI LDLLFI CV 



HQFQYMGKRYIARNQEGLG — PIVHDRKS QTLPVSRNRTGMM 

HQFQYMGKRYIAR TGMM 



FHFRYMGKRNIARVHDAWLSKHFGIDRKS-- QTMPALRNRSGVM 

QNFRYLGKRNI IRKIAAAF — KFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSKHEGHK 

SCFEYKGKKVFERMNSLTFK — KSKDMRAK LEEAI LG S I GARQEMV 

LC FE YKGKQS S DKVS TQVLQ — KSRDVKAR LEEALLRGEGARGEMM 



HARLQQL GSLDNS L T FNHS YGHS DADVLHQS LLEANI ATEVC 

HARLQQL GSLDNS LT FNHS YGHS DADVLHQS LLEANI ATEVC 



QARLQHL SSLESS FTLNHSSTTTEADI FHQALLEGNTATEVS 

QHRSQTLPI IRGK — — NALSNPKL LQMLDNTMTSNSNEIDIVHHVDTEANIATEGC 

RRSRGQLERS PS GS AFGSQENLRWRKDMTHWRQNTEKLDKSRAE I EHEAL I DGNLATEAN 
RRRAPGNDRFP GLNENLRWKKEQTHWRQANEKLDKTKAELDQEALI SGNLATEAH 



LTALDTLSLFTIAFKNQLLADHGHNPU4KKVFDVYLCFLQKHQSETALKNV^ 
LTALDTLSLFTIiAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 

KL S RGHS PLMKKVFDVYLC FLQKHQS EMALKNVFTALRS L I Y 

LTVLDTISFFTQCFKTHFIjNNDGHNPLMKKVFDIHIAFLKNGQSEVSLkHVFASLRAFIS 
LTILDLVSLFTQTHQRQLQQCDCQNSLblKRGFDTYMLFFQVNQSATALKHVFASLRLFVC 
LIILDTLEIWQTVS — VTES — KESILGGVLKVLLHSMACNQSAVYLQHCFATQRALVS 
LI ILDMQENI I.QASS — ALDC — KDSLLGGVLRVLVNS LNCDQS TT YLTHC FATLRALIA 



KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 
KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 
KFPSTFYEGRADMCASLCYEVLKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 
KFPSAFFKGRVNMCAAFCYEVLKCCTSKISSTRNEASALLYLLMRNNFEYTICRKTFLRTH 
KFPSAFFQGPADLCGS FCYEVLKCCNHRSRS TQTEASALLYLFMRKNFEFNKQKS IVRSH 
KFPELLFEEETEQCADLCLRLLRHCS SSI GT IRSHPSASLYLLMRQNFE IGN — NFARVK 
KFGDLL FEEEVEQC FDLCHQVLHHCS S SMDVTRS QACATL YLLMRFS FGAT S — N FARVK 



LQVIISVSQLIADWGIGETRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 
LQVIISVSQLIADWGIGGTRFQQSLSIINNCANSDPJilKHTSFSSDVKDLTKRIRTVLM 
LQVI ISLSQLIADWGIGGTRFQQSLS I INNCANSDRLIKHTS FSSDVKDLTKRIRTVLM 
LQI I IAVSQLIADVALSGGSRFQESLFI INNFANSDRPMLARAFPAEVKDLTKRI RTVLM 
LQLIKAVSQLIAD-AGIGGSRFQHSIJU:TNNFANGDK(31KNSNFPAEVKDLTKRIRTVLM 
MQVPMSLSSLVGTSQNFireEFLRRSLKTILTYAEEDLEIiPXTTFPDQVQDLVFNLHMILS 
MQNnMSLASLVGRAPDFNEEHLRRSLRTIIiAYSEEDTAMQMTPFPTQVEELLCNLNSILY 
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ATAQMKEHENDPEMLVDLQ YS LAKS YAS T PE LRKT WLDSMAR I HVKNGI LSEAAMCYVHV 
ATAQMKEHEKDPEMLIDLQYSLAKSYASTPELRKTWLDSMAKIHVKNGE FSEAAMCYVHV 
ATAQMKE HEKD PEMLVDLQ Y S LAN S YAS T PE LRRT W LE SMAK I HARNGE LSEAAMCYIHI 
DTVKMKEHQEDPEML I DLMYRIAKGYQTS PDLRLTWLQNMAGKHSERS1< HAEAAQCLVHS 
DTVKMREFQEDPEMLMDLMYRIAKSYQASPDLRLTWLQNMAEKHTKKKC rTEAAMCLVHA 
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TALVAEYI TRKEA — 
TALVAEYI TRKEAD- 
AALVAEFI HRKKL-- 

AALIAEYljKRKGYWKVEKIcjrASLLSEDTHPCDSNSLLTTPSGjGSMFSMGWPAFLSITPN 

— RKYLPVGCVT FQN ISSN 
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HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



-VH FNE DVLMELLEQCADGLWKAERYEL I AD I YKL 1 1 P I 
-VHFNEDVLMELLEQCADGLWKAERYELIADIYKLIIPI 
-VHFNEDVLMELLEQCADGLWKAERLRAGliLTSINSSSP 
-VHYSEEVLLELLEQCVNGLWKAERYE 1 1 SEI SKLIGP I 
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I DEEASMME DVGMQD- - 

I DEEASMME DVGMQD 

I DEEASMMEDVGMQD — 
I DEE GAMKEDAGMMD — 
I KEE GAAKEDS GMHD — 
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YEKRRDFERLAHI YDTI HRA rSKV rEVMHSGRRLLGI^^^AFFGQAAQYQFTDSETDVE 

SMKSGGTLETTHI YDTI HRP fSKV TEVITR -A- AGSWDLLPGGLFGQ 

YENRRE FENLT Q\ YRTI HGA fTKI LEVMHTKKRLLG ; " — TFFRVA FYGQ 

FEKQRDFKKLSDI YYDI HRS KLKV &EWNSEKRLFG PjYYRVjAFYGQ 

HEANRDAKKLST3 HGKL QEA FSKI VHQS TGWERMFG 1 YFRV GFYG- 

LEAHREFRKLTL1 HSKI QRA ETJSI VNKDH— KRMFG * qY FKV| G FFG — 



ITAM IT AM 
- FFEDEDGKSYIYKEPKLTPLSEI SQRLLKLVSDMFGSENVKMIQDSGKVNPKDLDSKSa 



GFFEDEDGKE YIYKEE KLTPLSEISQRLLKI 



YSDK FGSENVKMIQDSGKVNPKDLDSK YA 



GFFEDEDGKE YIYKEI KLTPLSEISQRLLKI YSDB FGSENVKMIQDSGKVNPKDLDSK FA 
SFFEEEDGKE YIYKEI KLTGLSEI SLRLVKI YGEF FGTENVKI I QDS DKVNAKELDPK SfA 
GFFEEEEGKE YIYKEF KLTGLS E I S QRLLKL YADF FGADNVKI IQDSNKVNPKDLDPK £A 
TKFGDLDEQE FVYKEI RITKLAEISHRLEGE YGEF FGEDWEVIKDSNPVDKCKLDPN SA 
S KFGDLDEQgFVYKE Pfel TKLPE I S HRLEAJ|YGQC|FGAE FVEVI KD S T PVDKTKLDPt^A 
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Coiled-Coil 1 



IHCFPYVKKRIPVMYQHHTD] NPIEVAIDEMSKKVAELRQLCSSAEVDMIKLG L.KLQGSV 
IHCFPYVKKRIPVMYQHHTD] ,NPIEVAIDEMSKKVAELRQLCSSAEVDMIKLC LKLQGSV 
IHCFPYVKKRIPVMYQHHTDI NP I EVAI DEMS KKVAE LHQLCS S AEVDM I KLQ LKLQGSV 
SNS FPYVKKRI P INCEQQIN] KP I DGATDE I KDKTAELQKLCS STDVDMI QLC LKLQGWV 
SHLFPYVKKRIQVISQSSTE] NP IEVAI DEMSRKVSELNQLCTMEEVDMI SLQ LKLQGSV 
SHAFP Y I KTRVNVTHKEE II] TPI EVAI EDMQKKTQELAFATHQDPADPKMLC 4VLQGSV 
MHAFP Y I KTRI S V I QKEE FV] TPIEVAIEDMKKKTLQIAVAINQEPPDAKMLQHVLQGSV 
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SVQVNAGPIAYARAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQAI AVNERLIKEDQLE 
SVQVNAGPLAYARAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQAI AVNERLIKEDQLE 
SVQVNAGPIJVYARAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQAI AVNERLIKEDQLE 
SVQVNAGPIAYARAFLNDSQASKYPPKKVSELKDMFRKFI QACS IAI ELNERLI KEDQVE 
SVKVNAGPMAYARAFLEETNAKKYPDNQVKLLKE I FRQFADACGQAI DVNERL I KEDQLE 
G TTVNQGPLEVAQVFLS E I PS DPKLFRHHNKLRLC FKDFTKRCE DAI RKNKS L I GPVQKE 
GATVNQGPLEVAQVFLAE I PADPKLYRHHNKLRLCFKEFIMRCGEA MEKNKRLI TADQRE 
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YQEEMKANYREMAKELSEIMHEjQICPLEEKTS-VLPNSLHIFNAISGTPTSTMVHGMTSS 

YQEEMKANYREMAKELSE IMHE QLG 

YQEEMKANYREIRKELSDI I VI RICPGEDKRATKFPAHLQRHQRDTNKHSGSRVDQFILS 
YHEGLKSNFRDMVKELSDI IHE QILQEDTMHSPWMSNTLHVFCAISGTSSDRGYGSPP^ 

YQEELRSHYKDMLSELSTVMNE QITGRDDLSK RGVDQTCTRVI SKATPALPTVS I SS 

YQRELG KLSS PZ ' : 

YQQELKKNYNKLKENLRPMIEF KI PELYKPI FRVESQKRDS FHRS S FRKCETQLSQGS Z- 
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PKLTGLSEISLRLVKLYGEKFGTENVKI IQDSDKVNAKELDPKYA ilQVTYVKPYFD DKE 
PAI TKLAE I S HRLEGF YGERFGED WEVI KD SNP VDKCKLD PNKA £ IQITYVEPYFD PYE 
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RTII TTSHI FPYV KKRIQVISQSSTELNjPIEVAIDEMbRKVSELN 
ggXI IAJH£ E£XV SKRIPVMYQHHTDLN PIEVAIDEM 5KKVAELH 



RTII TAIHC FPYV KKRI PVMYQHHTDLN PIEVAIDEM 3KKVAELR 
TAIHC FPYV KKRI PVMYQHHTDLN PIEVAIDEM 3KKVAELR 



RT I 



RTII 
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RTII TTSNS FPYV KKRIPINCEQQINLK PIDVATDEI SDKTAELQ 
KTII — " 
STVI 
RTSI 



XTSHA FPYI CTRVNVTHKEEI ILT PIEVAIEDM 2KKTQELA 
TTMHA FPYI ECTRI S VI QKEEFVLT PIEVAIEDM KKKTLQLA 
YLVQS LPGI 3RWFEVEKREWEMS PLENAIEVL 3NKNQQLK 
RTSF VTAYE LPGI LiRWFEVVHMSQTTI S PLENAIETM 3TANEKIL 
RTTI TLTHS LPGI 3RWFEVERRELVEVS PLENAIQW SNKNQELR 
RTSF VTAYK LPGI LRWFEWHMSQTTIS PLENAIETM 3TANEKIL 
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/AQVFLSE 
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M LQMVLQGSVGATVNQG PLE /AQVFLAE 
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L LSMCLNGVIDAAVNGG IAR fQEAFFDK DYINKHjP 
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E 3E 

GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT TAT CAT GAG ATT AAA ATA GAG TTG CCC ACT 
val leu his his his gin asn pro glu phe tyr asp glu ile lys ile glu leu pro thr 

bE IE 

CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
gin leu his glu lys his his leu leu leu thr phe phe his val ser cys asp asn ser . 

1EE 1SE 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
ser lys gly ser thr lys lys arg asp val val glu thr gin val gly tyr ser trp leu 

lfi£ E1E 

CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
pro leu leu lys asp gly arg val val thr ser glu gin his ile pro val ser ala asn 

SHE E7E 

CTT CCT TCG GGC TAT CTT GGC TAC CAA GAG CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu pro ser gly tyr leu gly tyr gin glu leu gly met gly arg his tyr gly pro glu 

30E 33E 

ATT AAA TGG GTA GAT GGA GGC AAG CCA CTG CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 

ile lys trp val asp gly gly lys pro leu leu lys ile ser thr his leu val ser thr 

■ ref 1-2 and 1-3 

3bE 1 31E 

GTG TAT ACT CAG GAT CAG CAT TTA CAT AAT TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 

val tyr thr gin asp gin his leu his asn phe phe gin tyr cys gin lys thr glu ser 

MES MSS 

GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
gly ala gin ala leu gly asn glu leu val lys tyr leu lys ser leu his ala met glu 

M6E S1E 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
gly his val met ile ala phe leu pro thr ile leu asn gin leu phe arg val leu thr 

SME S7E 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
arg ala thr gin glu glu val ala val asn val thr arg val ile ile his val val ala 

bQE b3E 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
gin cys his glu glu gly leu glu ser his leu arg ser tyr val lys tyr ala tyr lys 

bbE blE 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
ala glu pro tyr val ala ser glu tyr lys thr val his glu glu leu thr lys ser met 
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722 7SB 

ACC ACG ATT CTC A AG CCT TCT GCC GAT TTC CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 
thr thr ile leu lys pro ser ala asp phe leu thr ser asn lys leu leu arg tyr ser 

762 ai2 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT ATG GCT CAG CAT TTG ATA GAG AAC TCC AAA 
trp phe phe phe asp val leu ile lys ser met ala gin his leu ile glu asn ser lys 

2 I Cadherin Cleavage I 67S 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT GCA TCC TAT CAT CAT GCA GCG GAA ACC GTT 
val lys leu leu arg asn gin arg phe pro ala ser tyr his his ala ala glu thr val 

102 ™ 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 
val asn met leu met pro his ile thr gin lys phe gly asp asn pro glu ala ser lys 

1b2 ""2 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
asn ala asn his ser leu ala val phe ile lys arg cys phe thr phe met asp arg gly 

1DS2 1052 i 

TTT GTC TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 

phe val phe lys gin ile asn asn tyr ile ser cys phe ala pro gly asp pro lys thr 

1062 

CTC TTT GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 
leu phe glu tyr lys phe glu phe leu arg val val cys asn his glu his tyr ile pro 

11M2 "72 

TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
leu asn leu pro met pro phe gly lys gly arg ile gin arg tyr gin asp leu gin leu 

1202 1232 

GAC TAC TCA TTA ACA GAT GAG TTC TGC. AGA AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asp tyr ser leu thr asp glu phe cys arg asn his phe leu val gly leu leu leu arg 

12b2 1212 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val gly thr ala leu gin glu phe arg glu val arg leu ile ala ile ser val leu 

■ ref 3-1 

1322 1352 1 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT GAC AGA TAT GCT TCA AGG AGC CAT CAG GCA 
lys asn leu leu ile lys his ser phe asp asp arg tyr ala ser arg ser his gin ala 

1362 1M12/M71 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 
arg ile ala thr leu tyr leu pro leu phe gly leu leu ile glu asn val gin arg ile 

1MM2 I 472 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn val arg asp val ser pro phe pro val asn ala gly met thr val lys asp glu ser 



CTG GCT CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG CAG A AG GGA AGC ACC CTG G AC A AC 

leu ala leu pro ala val asn pro leu val thr pro gin lys gly ser thr leu asp asn 

■ ref M--1 and M-2 
lSb2 1512 4 

AGC CTG CAC AAG GAC CTG CTG GGC GCC ATC TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 

ser leu his lys asp leu leu gly ala ile ser gly ile ala ser pro tyr thr thr ser 

lb22 ItSS 

ACT CCA AAC ATC AAC AGT GTG AGA AAT GCT GAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 

thr pro asn ile asn ser val arg asn ala asp ser arg gly ser leu ile ser thr asp 

ref S-l and S-2 _ 

IbfiS 1712 1 

TCG GGT AAC AGC CTT CCA GAA AGG AAT AGT GAG AAG AGC AAT TCC CTG GAT AAG^CAC CAA 

ser gly asn ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys his gin 

17MB 1772 

CAA AGT AGC ACA TTG GGA AAT TCC GTG GTT CGC TGT GAT AAA CTT GAC CAG TCT GAG ATT 
gin ser ser thr leu gly asn ser val val arg cys asp lys leu asp gin ser glu ile 

lfl02 1A32 

AAG AGC CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 
lys ser leu leu met cys phe leu tyr ile leu lys ser met ser asp asp ala leu phe 

lob2 ifl^a 

ACA TAT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 

thr tyr trp asn lys ala ser thr ser glu leu met asp phe phe thr ile ser glu val 
| ref b-1 

1122 J ns2 

TGC CTG CAC CAG TTC CAG TAC ATG GGG AAG CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 

cys leu his gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 

naa boib 

GGA CCC ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 
gly pro ile val his asp arg lys ser gin thr leu pro val ser arg asn arg thr gly 

2DH2 2072 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
met met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn his 

2102 2132 

AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 
ser tyr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn ile ala 

ref 7-1 | 

BlbB 2112 | 

ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
thr glu val cys leu thr ala leu asp thr leu ser leu phe thr leu ala phe lys asn 

2222 2BS2 

CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 
gin leu leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 

B2oB 2312 
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TGT TTT CTT CAA AAA CAT CAG TCT G A A ACG GCT TTA AAA AAT GTC TTC ACT GCC TTA AGG 
cys phe leu gin lys his gin ser glu thr ala leu lys asn val phe thr ala leu arg 

23 4 H E37E 

TCC TTA ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA GGG AGA GCG GAC ATG TGT GCG GCT 
ser leu ile tyr lys phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

SHOE 243E 

CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyr glu ile leu lys cys cys asn ser lys leu ser ser ile arg thr glu ala 

E4bE 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
ser gin leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 

ESSE "BE 

GTC CGG ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
val arg thr his leu gin val ile ile ser val ser gin leu ile ala asp val val gly 

ESfiE 2bl2 

ATT GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC ATC AAC AAC TGT GCC AAC ACT GAC 
ile gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

EbME Sb72 

CGG CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
arg leu ile lys his thr ser phe ser ser asp val lys asp leu thr lys arg ile arg 

E7DE 2732 

ACG GTG CTA ATG. GCC ACC GCC CAG ATG AAG GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
thr val leu met ala thr ala gin met lys glu his glu asn asp pro glu met leu val 

E7b£ 2 "" 2 

GAC CTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
asp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2fla2 EASE Ixxxxxxxxxxxxxxx Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

Transmembrane Domain xxxxxxxxxxxxxxxxxxxxxxxxxx I 

TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA GGC GTG TTT AGA CAA 
tyr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

ET4E 2<172 

GGA TGC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 

gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 
ref fi-l| 

300E 1 3D3E 

GAC GTG GGG ATG CAG GAT GTC CAT TTC AAC GAG . GAT GTG CTG ATG GAG CTC CTT GAG CAG 

asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 
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TGC GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 
cys ala asp gly leu trp lys ala glu arg tyr glu leu ile ala asp ile tyr lys leu 



_ ref T.l 

3122 I 3152 

ATC ATC CCC ATT TAT GAG A AG CGG*AGG GAT TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
ile ile pro ile tyr glu lys arg arg asp phe phe glu asp glu asp gly lys glu tyr 

BlfiS 3212 

ATT TAC AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 

ile tyr lys glu pro lys leu thr pro leu ser glu ile ser gin arg leu leu lys leu 

ref 1D.1 - 

32M2 3272 I 

TAC TCG GAT AAA TTT GGT TCT GAA AAT GTC AAA ATG ATA CAG GAT TCT GGC AAG GTC AAC 

tyr ser asp lys phe gly ser glu asn val lys met ile gin asp ser gly lys val asn 

33Q2 3332 

CCT AAG GAT CTG GAT TCT AAG TAT GCA TAC ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
pro lys asp leu asp ser lys tyr ala tyr ile gin val thr his val ile pro phe phe 

33b2 3312 

GAC GAA AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
asp glu lys glu leu gin glu arg lys thr glu phe glu arg ser his asn ile arg arg 

3M22 3MS2 

TTC ATG TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 
phe met phe glu met pro phe thr gin thr gly lys arg gin gly gly val glu glu gin 
■ ref 11-1 

34fi2 #• 3S12 

TGC AAA CGG CGC ACC ATC CTG ACA GCC ATA CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
cys lys arg arg thr ile leu thr ala ile his cys phe pro tyr val lys lys arg ile 

3SM2 3572 Ixxxxxxxx Coiled-coil 1 xxxxxx 

CCT GTC ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
pro val met tyr gin his his thr asp leu asn pro ile glu val ala ile asp glu met 

3bQ2 xxxxxxxx Colled coil 1 cont'd xxxx 3b32 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
AGT AAG AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
ser lys lys val ala glu leu arg gin leu cys ser ser ala glu val asp met ile lys 

I ref 12-1 

CTG CAG CTC AAA CTC CAG GGC AGC GTG AGT GTT CAG GTC AAT GCT GGC CCA CTA GCA TAT 
leu gin leu lys leu gin gly ser val ser val gin val asn ala gly pro leu ala tyr 

3722 3752 

GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 
ala arg ala phe leu asp asp thr asn thr lys arg tyr pro asp asn lys val lys leu 

37A2 3fil2 I xxxxxxxxxxxxxxxxxx 

CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
leu lys glu val phe arg gin phe val glu ala cys gly gin ala leu ala val asn glu 

3BM2 xxxxxxx Coiled coil 2 xxxxxxxxxx 3fi72 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
CGT CTG ATT AAA GAA GAC CAG CTC GAG TAT CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 
arg leu ile lys glu asp gin leu glu tyr gin glu glu met lys ala asn tyr arg glu 

3102 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 3132 xxx I 



ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG ATC TGC CCC CTG GAG GAG AAG ACG 
met ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 

31b2 ana 

AGC GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 
ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser thr 

14022 Ixxxx PHM xxxxxl 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
met val his gly met thr ser ser ser ser val val STP 

MQA2 4112 

TGG GGA CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 
MIME 4172 

ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 
MHOE 4232 

TCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 
M2b2 4212 

GCA ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 
M3S2 4352 

CTT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 
■4362 4M12 

GTT CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG GGG GAC CTT TTG 
■ ref 13-1 

4M42 I M172 

CCT CGA CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG TAC AGA ACT TAC TAG TTT TGT CTA 
MSD2 4S32 

GGA GTA TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 
45b2 4512 

TAG TGA GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG CTA CTG GCT TCA AGT CAG AAC TTT 

-ref 14-1 

Hb22 MbS2 ■ 

GTC ATT AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT ATT ACA TTT CTA CAT TTT TAA TAC 
4bfi2 4712 

TCA CAT GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 
M745 477B 

ATA CAC TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 

4B02 
TTT ACT 



BAC sequences of Human CLASP 2 
Refl.l 

Sequence of BAC4 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is underlined and represents nucleotides 356-375. 

TTTCTTACA GNGTT^ACTCAOGTATGTGCTCCTTCAACAAAATTAGCAGTTGCTGCTCTG 

TGACAAAGTTTGCAC CATTTTG CAAGAAGAAAAAAATCCTAATGTGTTATATTACTATA 

TTTTT A CTCT AT A G ATCTTTTTCT AAA G AAA G AAAGT A C AA CTG AA GTGCTT AT ATGT A 

TTCATATAAATGACTAGTACAAGCATCATTITGCAACAGATTTCCCCITTCATTGGAGG 

ATCTTCTTGATGTTATTTGTACACGATCAATTITTAGTCTTAATAAGATGAGGCTGGGTG 

TGGTGGCTCACACCTGTAATCCTAGCATTTTGGAGGCCAAGGTGGGCAGATCACTTTAG 

CCCAGGGGTTTGAGACCAGCCTGGCCAACATGGCAAAACCTTGTCTCTACAAAAATAC 

NAAAATTATCCAGGCATGGTGATGTGTGCCTGTAGTCCCAACTNCCTAGGAGGCTAGG 

GGTAGGGGGATTTGCAAGAGGCTGGGAGGGTCAAAGCCCNAANTGAGCCATTGGTNC 

ATGTCACHTGGACCCCAAGC^NNGGGGNGANCAAGAGCAAAGGACTN^^ 

AAAAAAAACCGGGCnAGCATACrWACCAACCCNOS^^ 

AAAANAAGGCTTTGNCTTGCANAGGAAAANCAAAATNNCC 

Refl.2 

Sequence of BAC26 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is underlined and represents nucleotides 351-375. 

TCTGGTTTfrrAGAOTGTATACTNAn rTTATrTTnrTrrTTNA AP A A A ATTAfiT AfiTTOrrn 

CTCTGTGACAAAGTTTGCACCATITTGCAAGAAGAAAAAAATCCTAATGTGTTATATTA 

CTATATTTTTACTCTATAGATCTTITrCTAAAGAAAGAAAGTACAACT 

ATGTATTCATATAAATGACTAGTACAAGCATCATTTTGCAAGAGATTTCCCCITTCATT 

GGAGGATCTTCITGATGTTATTTGTACACGATCAATITTTAGTCrTAATAAGATGAGGC 

TGGGTGTGGTGGCTCACACCTGTAATCCTAGGATTTTGGAGGCCAAGGTGGGCAGATC 

ACTTTAGCCCAGGGGTTrGAGACCAGCCTGGCCAACATGGCAAAACCTTGTCTCTACA 

AAAATACAAAAATTATCCAGGCATGGTGATGTGT.GCCTGTAGTCCCAGCTACCTAGGA 

GGCTAGGGTAGGGGGATTGCAAGAGGCTNGGAGGTCAAGGCCCGCAGTGAGCCATGG 

TCATGTCACTGCACCCCCAGCCAGGGCCGACAGGAGCAAGACTmTGTNTCAAAAAAA 

AACAGNAACCAACANCCAACAACAACAACNACCTTTC^GCAAAANAAGCITGCTNCA 

ANGAAACCAAAATGNCTTCTThTrTTTCCCCCN 

Ref 13 

Sequence of BAC26 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is not found within this sequence. This sequence most likely represent intron sequence 
since this sequence matches the intron sequence found in the previous two BAC sequences. 
AGNNNIWCCC^CTACNCCACIT^ 

GCrCCTTCACATATTAGGAGTTGCTGCTCTGTGACATAGTTGCACCATTNTGCAAGAAG 
AAAAAATCCTAAGTGTNATATCACTATATNNNTACTCTATAGAT 

AAAGTCAACTGATGTGCTrATATGTATOCATATAAATGACTAGTACATGCATCATTTTG 
CAACAGATOTCTCCTCACATTGGAGGATCTTCTNGANGNATrCGACACGATNANTATTA 
GTCn^AATAAGATGANGCTGGTGTGX3NGGTACACTGNATCTAGCATT^GGANGCATGT 
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GGCAGACACITANCCNCGGTNGAGACAGCTGTCACTGNCNAACTGTCTCrNTAAANCA 
AANNCTCCGCNGGNG ATGGGCTG AGCC AGTCCTAGNN GCT AGNT AGN G ATGNN G AGN 
TGTNGCACGNCGAGNGAGCATGNTCTGTACTGACTCATCAGGCGNCNACACGN TCTG T 
TCNAAAACATACCACACACACTCNCACCTNCGCAAAATTC 
TTCACACNGNTNCAATC^CTATATNNTCTTCTATTC^ 

CNCTGCANAAC^ATNCGNCCACCTKNANNACCTTANGCITNGTTTCACGCI^ 

CCCTACAC^mmCAGCNmTNCNNGTGAAGGGCCNCCCGAATCTACGANCATACT 

TCCGTATATNGCCTCGGTCANCGra 

TNCGCTATCTCTNNNCCGGATCC^CNCCATATNNTNNCTCTACTTANAG^ 
NCNCNCACTANTCACAACTTNTNCNTNNAACT 

TACTACCTmTCACTvrCANTCTCOTC^CTKrCCACTGATGTCCACATAGCT 
GCCANTTTATCATATNCACAC^CTCTACGCT^m^^^NT 

Ref 2.1 

Sequence of BAC4 using primer HC2S1, which spans nucleotides 1 107-1 126 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1079-1097. 

r-rTGTATTN A A AH AGGGTCT GCAGGAAGAAGTGTGTAGTCATAAATACCTCACTGGAT 

ATITTATACAGGATTCTAAAAAACCTATTAGCAATAGTATGCTAGAAATAGTCATTAGC 

TTCTTGACCTTCTTAGAACTGCACACTCTATTGCACTGTACAGATTTGAGGATGGCTGC 

AGGG ATTG ATTTG AAAA CT AAGG AC AC ATTTC AAT AAAC AATGTCTTC AATTG A 1 1 1 IT 

AGGGCTCCTCCTACITCAATGAAGGACTTCAGGTAGCTTATAATTACAGACACAGGCTC 

AAT AC AAT AAAAAAATT AGTAAGGC AG AGCITT AAAAAAAAAAAAGG AAAAAG AT AA 

TTCTACCAGAGAAAGG<JrACATG<}TGACTTCTGTTACCAGTAACAACCCCCGCACTACC 

TTTGGGTCTCCAGGAGCAAAACAGCTAATGTAGTTGTTGATCTGCTTGAAGACAAAGC 

CCCTGTCCATGAAGGTGAAACATCTCTGTGGAGGAAAACAAGCAAAuAAAGTTATTTCA 

GGTCCAAACATTTCGGAAATTTGGATTCAAAGCAGGCATTTATTG 

CACTGACATAAAAAACATGCCTTCAACATTGCCAGAGCACCTACTCTATTNTAGTCNCN 
Ref 3.1 

Sequence of BAC4 using primer C96AS, which spans nucleotides 1443-1452 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1370-1422. 

A ATC AGCA GLA PC! A A ACAG AGGCAGG T AO AflGGTGn(TrATC(TTTGK^CTGATGGCTCT GA 
AAAGAAGACACACATG<jTAAGTTTGACCCAGGATTCTGAGAACCGA ACTAA GTTGGTG 
CTGACCATCTCCnTTATTTGGATCCOT^ 

AATAGAGCAAAATCTTAGTGCTGTTACCATGAATTTTCTAACTGAT^ 
CACTTAAAATAAAGGACATTATCAATGCACAT^^ 

gaagcatatctctcatcaaaagaatgctt^^ 

GATCAGACGGACCTCCCGGAACrCCTGGAGGGCTGTCCCCACCTCCCTNAGTAACAGT 

cccaccaagaagtggtttctgcagaactcatctgttaatgagtagtcaagctgggagg 

TCTGAAATGAGGATAGAAACTACTTTGNGTTAGGAAAGATGCAATGCTCnTTTGAATA 

AAACAAACAAACCAAAC^AACAAAAAAAAAACTAAGACCCATCCTTNTGNATTTCA^ 

GCCCACCCTGGGGTNGGTCAAAGAGATGATCAGNANTTT^ 

AAATNAATTNTCCAGGGGNTGTTCTNCTTTTTAGCACANGGAGGG 

ACCAATTTAAATCCAATTNAGGNG 

Ref 4.1 



Sequence of BAC4 using primer C2AS5, which spans nucleotides 1716-1735 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1602-1703. 
TTCCTTTTCTGCAAGGCTGTTCCCGAATCTGTGCT 

TTT<TTCACA(^GTTGATGTTTGGAGTTGAGGTTGTATATGGAGAAGCT AAATGGAAATr. 

AAGCCAACAATAAAGTTTTATTAAGACAGAACAAAATAAAGATGAGTACTGAACTTTA 

AGGGAAATTGCITTTATTGCACTTATTTTTTCTGTTAGGAAGTTGGCT 

TCCATTACTTCACCTTTAAAGAACCAGGTCATATACAATGAGATAAAAAGAAACTAGT 

CTGAAACATTCAGATGTAAACATCAATTCACTTGTTAGAAACCACCTTTGATCGCTAAA 

GACTAAATGCATACCTGTTTCAGAATGTGATAGAATGAAGACTTAAAAAAATTAAAAG 

ATAAATCCACCTAGAACTATCAAATCACAAAATTAAACCACACAACAAACTTGTAGCA 

TTCAAACTGGTAATAAACACTGAGGAGCCTACCCAACTCTGAGGGGTGTCATGGGGTA 

TTTTAAATTTTCGAGGAGAACACAGTGATATGTGACCTCAGCCAGAAGCTGCTGTTTNA 

GCAGCAGGTTGGTGCTATGCTCCTTTTTGAAGACATATTTGTGAAGCTGGGTATTTTGG 

GGGGCCTGCTTATGATAAAANGGCAAGGTNTTCAATGNAGGGGN 

Ref4.2 

Sequence of BAC26 using primer C2AS5, which spans nucleotides 1716-1 735 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1602-1703. 

TTCCTTTCTGGAAGGCTGTTACCCGAATCTGTGCTTATGAGAGATCCTCTCGAATCAGC 

ATTTCTrCAr.AfTrGTrGATGTTTGGAGTTGAGGTTGTATATGGAGAAGCT AAATGGAAAT 

CAAGCCAACAATAAAGTTTTATTAAGACAGAACAAAATAAAGATGAGTACTGAACTTT 

AAGGGAAATTGCTTTTATTGCACITATTTTTTCTGTTAGGAAGTTGGCT 

ATTCCATTACTTCACCTTTAAAGAACCAGGTCATATACAATGAGATAAAAAGAAACTA 

GTCrGAAACATTCAGATGTAAACATCAATTCACTTGTTAGAAACCACCTTTGATCGCTA 

AAGACTAAATGCATACCTGTTTCAGAATGTGATAGAATGAAGACTTAAAAAAATTAAA 

AGATAAATCCACCrACAACTATCAAATCACAAAATTAAACCnSfCACAACAAA(^GTAG 

CATTCAAACTGGTAATAAAACACTGAGGAGCCTACCCAACTTTGAGGGGTGTeAATGG 

GGThTITTTTAAATTTTTCGNGGGANANCCCAGTG 

TTGTTTGTTTNACCAAGCNAGGTTGNNCTNTGCTCCrnTTTAGAANAC^ 
AAATNCTGGNTTTTTTNNGNGGCCCCCTNCNTTNNT 

RefS.l 

Sequence of BAC4 using primer C2S6, which spans nucleotides 1686-1705 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1724-1736. 

TTCrrrGGATAAG GTAATTGCTTTTACCCAACACAAATGTTTCITATA ATCAA T GGATTT 
AGCCCAAAGTAAACGTACITCATGTTCTAGTGCCTTTTAAGTGTGACuilllG , ri , l"l - ri'r 
CTAAACCACCCGGCTGACCTGGAGTAGGTGATGAGAGCTTTAAGGTTGGGGCCCATTC 
CrTGAAGTGCTCTGATTCCTGTTTCCAGTACCTCAGATCCTGGGCAGGGTTTGCAGTGG 
AGCGTCTTGAGTGAATGGCTCTGGTGGGTTGAACGGGGAGGGACTCAAAATGCTGCCC 
ATCTCAATTTCCTGTAGTCTTTTTATTTATTTA 

GTCGCCCAGGCTGGAGTACAGCGGCACGATCTCAATTNACTGCAACCTCCGCCTCC:TG 

GGTTCAAACGACTCCTCTGCCTCAGCCTCCCCAGCAGC:TGGGACCACAGGCACAAGCC 

ACCACCGCCCGGCTAATTTTTTGThnTTTTAGTA:GAGAT:GGGGTTTCACCATATTTG 

CAGGCTGGGCTCAAACTCCTGACC:TCGTCATCCGCNGCCTCGGNCTNCCA AAGTGCT T 

GGGATTNGAGGC^GTGAGCCCACTTACACCTNGGGCAATrCCCTGTNAGTCllllilAC 

CAGAGACACCATCATTCAACACAGCTTTTCCACCCACAA 



Fid*. lA«u**.) 



Ref5.2 

Sequence of BAC26 using primer C2S6, which spans nucleotides 1686-1705 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1712-1736. 

TflAGAAGAGCA ATTTCCTGG A T Ai^GT AATTGCTTTT ACCC AAC AC AAATG TTTCTT AT 
AATCAATGGATTTAGCCCAAAGTAAACGTACTTCATGTTCTAGTGCCTTTTAAGTGTGA 
CCTTTTG" 1"! TTTTI CTAAACC ACCCGGCTG ACCTGG AGT AGGTG ATG AG AGCTTT AAGG 
TTGGGGCCCATTCCTTGAAGTGCTCrGATTCCTGTTTCCAGTACCTCAGATCCTGGGCA 
GGGTTTGCAGTGGAGCGTCTTGAGTGAATGGCTCTGGTGGGTTGAACGGGGAGGGACT 
CAAAATGCTGCCCATCTCAATTTCCTGTAGTCTTTTTATTT^ 

AGAGTCTCGCTCTGTCGCCCAGGCTGGAGTACAGCGGCACGATCTCAATTCACTGCAA 

CCTCCGNCTCCCTGGGTTCAAACGACTCCTCTGNCTNAGNCTCCC:AGCAGCCTGGGAA 

CCACAGGCTCANGCCACCACGCCCGGCTAATTNTTGTAATTTTNAGTAANAAAT^ 

GGTTCTCACCAWITGGCCCAAGNCT 

NCNCCCCNACCNTGGGCNCTNCT 

CCCCNTATCACCGTGGNCCTTAATTT 



Ref6.1 

Sequence of BAC4 using primer C2S7, which spans nucleotides 1918-1937 of the cDNA. Exon 
sequence is not found within this sequence.Since the primer is directed against exon sequence we 
presume that sequence derived from C2S7 is intron sequence. 

NAGNGNGGGTTTNAGNCGTTTGAAGCCTGNNACGNGGTGNGTGGTNGAACTCTGTGGG 
CTTTCAGGTACTGGGGTATCTGGGAGCCTGCTGTTTGCATTGCTAGTGCATCAGACCAG 
GGCTTTTTCCTCCCTGTAGCTGCTACTTATACACATAGCTCTAACTGAGATGATTCTCCA 
GACAACTGATGCAGAGCAGCAAAAGCTTCTGCCGTTCTCCCCITCTAGGAGTGTCTCCT 
TTCITTGGAAAGAGATCATGAGGGGCTAGATTGTAATGAAGTGAGGCTCAGTGCTTGA 
GCACATCCGGTAAAAGTTCCAATATATTGGTCATAAAGTTTCTCATTCTTTATAGCAGT 
TAATTTCTCTGGCTCATGAGTTTTCITAGTT^ 

GCACCAGTCATATCCCCAGGGCAAACTCAAAGGCATGAGAGGCCAGACTCGGGTCCTG 

GTCATAGCAACCCCTGTCTAGGGCCTTGGTCCCTGCCTCCGCTTGTGTGCTGTGGCGCA 

GGTCCTATGGGCCCTTAGGAAACAGGACCACCCTGTCGCACCCCCTACAGAGACCAGC 

CAAGTITGACATTAGATCACCGTAGCAATGTNTGCAAATTC 

GGTTAAGCCTTGCAGCCACTTTATCTGTAACTGGCNGAGGTTTTGACATAAAA 

Ref7.1 

Sequence of BAG4 using primer C2S8, which spans nucleotides 2143-2162 of the cDNA. Exon 
sequence is underlined and represents nucleotides 2 1 82-22 19. 

(TCTCGA C A rGCTTGTTTCT ATT AA C ATTGGCGTTT A AG GTTTGTATC AATTTGCTGTTCG 
NGGTTCTAGTTTTACCTTTCACATTCATTCT 

CTATGTTGCAr i'l 1 1 1 1 ACTrCAGCAATTATTTTTGTCCCTGTAAGGAAACCATTAATCTTT 

AAATTCCTTTAATGAAATCATTCCACAGTGAATGGCTTGAATGCCCTGAAATAAAATTT 

AACTGGTCAGTGTGTGCTGCGCGCTTGGGTATGGTGGAAACACGGTCTCTGGAGGCAG 

TTAACTCTTGGCTCGAACCTTGAGGATGGTGAATATAGGCACCTAATCAGGCATTTCTG 

CCTTGAATATCTTTAAATATATCCAAATGTTATAGCGTTTAATTAGATTTTTATGTAGAA 

AGGAGCAATAAACACAAGACACATGTTITCAGTTTTTTATCTGTTA 



T AAAAACGTTTTGG AG AT AG AAAATG AAAGGGG ITITTTTTTI GTCTTGTTTT AAAGTT 
TTAGCAAATAATATTCAAGTAGGTGGAGATGGACrCTTCACCACTCTCCTGTTTTTAGG 
AACCCAATACTTTTTCATTCITGCTAAATGATTACTTCCATTTCTAGCA 
GAAAATTGGAATGAGTGTTTATAT 

Ref8.1 

Sequence of BAC4 using primer C2S9, which spans nucleotides 2992-301 1 of the cDNA. Exon 

sequence is not found within this sequence. Since the primer is directed against exon sequence we 

presume that sequence derived from C2S9 is intron sequence 

CGCnTNAAATNCCAGCCGCTACTGCGGGGCGNTNAATTCGAAACGTC 

GATGCCTGGCTCTGATTGTGTGGGATTGGTCATCAGTGGCGGTTGGCAGNTGGGGTTCA -' 

TGGAAGCGGCCATGGGGACTGATGGCAGGCCCTTGGATTGCCACCGCAGAGCCTGGCA 

GTGTCITTGGTCTGCATTCCTACCGGCGAAGTCTCATTTC^ 

GAAAGCATTCCTTTAGCGGGCTGTGTCTACCCTTCCATCCTCTCGTCCAAACT 

CTTCTCTGTTCTGTCTCCnrTCCCATCCTCriTCTCCCCAG 

CAGTGGTTTCTCTTCCTCTGTTTGACITTCCAAGGTCATTTTGACT 

CTACAAAGATACTAAAATCTCACCTAACCACTCTTCTTCTTTCTTAATGAAAGAATGTT 

TTCAGTCCATCCCAAATTTGTGTGGACTTCACAAACCTTCTCTAAAATGGAGCCTTTTCT 

CTTCCTACTCTTGACTAGNTGGTAAACGCTCCATGTTCTTGGCCAGAACTCCCTGGTGA 

GTAGCGTCACTCCCACTTTCCTGTGCAGAACCAAGCCTCCTAGAAAACTCCTTTGCANC 

TGAGTGGGTTGGGACACGCCCITTNTTTGGG 

Ref9.1 

Sequence of BAC4 using primer C2AS10, which spans nucleotides 3276-3295 of the cDNA. Exon 
sequence is underlined and represents nucleotides 3 147-3234. 

TTTANACrnvJATNTrATCCGNGTCAGTTANAGGAGT(TrCTGAGAAATTTCCGACAGGnGT 

GTGAGTTTGGGTTCCITGTAAATATACTCCrTTCCATCTTCATCTTCAAAGAATCCCT GT 

GACATAAAGCACAATTAGAGCTATCCCTGAACGTAAGCCCAGGGCTTACCACCTAGGA 

AGCGTTCTTTTATTACAAGGGGGAAAAAAAGGAATGGGTCTAAAAATCCAGCTGAAAT 

GGGCTTTCTGAATGAGAAAGAAAATGCTAATAACAfGAAGTCTAGGTGCAAAGGTAAA 

GGAAAAACACAACATTGCAAACITATTCAAGAATGCAGTCATTAAGTGTTGAGTGAAA 

TGAAAGATTTTGGATACAAGACTAAGCTGTCCCAGGGAAGTCTAATGGGAGTCAAGCC 

TGTTTCACTTTCCCAAGAAGCAGAACTCACTANAAAATGATGAGCAGCCCACGACAGG 

CAGGCTCAGAAGTGGACATGCCTCCCITCrrCCTGATGGCTNCCATGCACACAGGATTTT 

ATGGCATGAACTGAAGCGTTTGGGGGTCTGGAGTAAGTTTAGTAAAAGTTAGGTAAAG 

CTTGTATAAATTGTATTTTTGCITTACCCGATG 

GCTTCAATATTCAAGAAAAATATTTTTCATNTCACCCG 

ReflO.l 

Sequence of BAC4 using primer C2S 1 1 , which spans nucleotides 3 1 67-3 1 86 of the cDNA. Exon 
sequence is underlined and represents nucleotides 323 1-3296. 

NGN A 'Mfyrnn A GCCNCG ANCC A flflfl A C, A ATCTTN A A nCTTNCTTT A A ACTGTACTCGG ATN A 
ATTTGGTTfTTG A A AATGTC AAAATQ ATACAGGATTCTGGCAAGGTATT GACCAT GTTTG 
GANAAGTTTCATAGCAATGTAATGTTGTGATNCGATTACATATOATATATTTITAAATG 
Tm-ATAGAAAAAAACACANGAAAAATATTAAGGATTGTTGGCCCGTGAGTGGCAGGTG 
TATNTTCTTNCTGATCCITTAGNGCTTTCCATTACATGCNTGACAT^ 



TCGCCTAATTTTTGAAACATCTAATTTTACAAAATAATTAACCGTNTGGCCANGNATAT 
TOTCATTTTTAGGNCCAGCTATTTAGAAACrCTGACANAAATGAGGGGCTGTGGCrTNC 
CTNCCniWACTTGNCCCTCriTTChWGNATG 

TNACCGGGTGGCATGTTANAGGACAGGTTGAAACCNCAm^GGG(^GGANTTNGGTN 

NAATTGGGACACAATGGTACNANGCTCTATNGGAATNGAAACTCTCCCNACNNNCNGT 

GNNCCKTGGGGAAAATGNGNCNNATTCATTTTN 

Refill 

Sequence of BAC4 using primer G2S12, which spans nucleotides 3474-3493 of the cDNA. Exon 
sequence is not found within this sequence. Since the primer is directed against exon sequence we 
presume that sequence derived from C2S9 is intron sequence 

AGNANNGTTNNGCAGCTGCANNTCTGGACCCANAGGCCGCANGGGCACGAGCCNGGA 

CACGCTCGGCAAAGAGCTGTCCAGAGGGATTCAGAAGCTTCAGGACTGGAAGGGTCTT 

TCGAGCTCAGTTAGCCACCCCCAGACCCATTTCAGTTTCACATITATCTAGTGCTTCCTT 

TTGAATACTTGGGATGTTTTTCTGTTGATCTGTTGGCACTTCCTTCTTCCACAAGACCAG 

AAGCTCATATCCAATCTAAGGTCACTTACCCTTCTGAGAATCTGATGAAAATGGCGTGC 

CTTATGTGCCTAGATGCITTTGCACACAGTCTAAGGTGACTTATGGAGTCCAGGTCCAG 

CAGCCACACCCAGTCCTGGGTCTCCGCACAGGGAGGGACCCGTCTTCACACACCTGTCT 

CAGGTTCTAGCATTGGGCTGCTTCAGCGGTCTCAGGCTGTGAGTAAATGGGATGTGAG 

CrTGGATCGCCCCACGCrGTTGNCCCCCGGGGGGCTTGGCCAGCTGGCCACTTNGAAAT 

GCCTCCTTTTGCCCAGGAAAGCTCACTGCATTTCAATGGGGNTTNTCCACGAAGTTCAN 

CTTTANGGG 

Refl2.1 

Sequence of BAC4 using primer C2S1 3, which spans nucleotides 3645-3664 of the cDNA. Exon 
sequence is underlined and represents nucleotides 3683-3699. 

a ana angtnnctcantnaann cagcgtgagngttcaggt gagccaggcacagcaggc 

cggagggcagcaggggacgtccttgcccctgggtgacttgagagtcgtttccactaac 

aaggtctacttgagagccrcggtttaccaagtgatccctgctcccttcccccaacgtnt 

gtgacatttctcctgatatcagagggggaggaaacctcatgatccctgcccgccgcccc 

atgaggactgactgtggggacaaagAgccagatctcatagactaccctgatttgtcag 

tatttggggaattctgggtgcctgattagaagcatcaagactcttctaaatncaaaga 

agtgtggagagcagtagattttcctataaaactggtgttggtggtttctatgaaaattg 

gatccaaaaaaagtccitaagtttaccctcttaatggnatcttttgattaatggaattc 

attattttaatatagcccaatcaatccaatttttctttattggt^ 

tttaaaaaaatcttggnctacctccaaaatttcacagatgttctcctagggttttcctcc 

ttttggttcaagcatcccattcaangtcttgcagtccattctgggg 

Refl3.1 

Sequence of BAC4 using primer C2S14, which spans nucleotides 4289-4308 of the cDNA. Exon 
sequence is underlined and represents nucleotides 4321-4448. 

G A CTTT ANATTTATTCTTCCTTGC AG AGTA GTGTTAG A AT AG ATGGCCTACAGAAAA AA A 
AfiC/rTGTG GCTATCTACATG^^ 

TTTGCCTCGA GGCTGAGCTGGAAAATCrTGAAAATAl'llUUUUl'lUCCTGTGGCACATTC 
AGGTTGAATACAAGAACTATTTTTGTGACTATGTTTTTGATGACCTAAGGGAACTGACC 
ATTGTAATTTTTGTACCAr^rTGAACCANGAGATTTAAGTGCTTTTATA 



ATTTAAGAAAATATGAAAGCTTAAGGAATTATGTGAGCTTAAAACTAGTCAAGCANTT 
TAGAACCAAAGGCCTATNTTNATAACCGCAACTATGCTNAAAAGNACAAAGTAGTACA 
GNATATTGOTATGTACATATCATTTGGTAATACACNCCNGGCNTTCTGTACATATATGT 
ATTACATTTCTACNTTTTTAATACTGCCNTGGGCTTATGCCNT^ 
AAATTTNGGCTGTTCCN GTNTATNCN ATACN CTTTT 

Refl4.1 

Sequence of BAC4 using primer C2AS15, which spans nucleotides 4680-4700 of the cDNA. Exon 
sequence is underlined and represents nucleotides 4660-4683. 

ATfiAGAATGTAA TACATATATGTACAGAATGCCAGGACTGTATTAACAATGATATGTA 

CATAACAATATACTGTACTACITTGTACTTTTCAGCATAGTTGCGGTTATTAATATAGG 

CCTITGGTTCTAAACTGCTTGACTAGTTTTAAGCTCACATAATTCCTTAAGCTTTCATAT 

TTTCTTAAATGCAAGGAAATGAATATAAAAGCACTAAATGTCCTGGTTCACrGGTACAA 

AAATTACAATGGTCAGTTCCCTTAGGTCATCAAAAACTAGTCACAAAAATAGTTCTTGT 

ATTCAACCTGAATGTGCCACAGGAAAAAAAAAATATTTTCAAGATTTTCCAGCTCAGC 

CTCGAGGCAAAAGGCCCCCAGGCATCAATGTCAGNGCAGCCCTCCTGCCATGTAGATC 

CCAGAACCTTTTTTTTCTGTAGGCCATCTATTCTAACACTACTCTGCAGGGAGAATAAA 

ATCTAAAGNCCAGCTCAAGAGTGCrACCACACCTTTGTTAAGACACAATGAAAACTTT 

GGATATTGGGAGGNGAGATTTAAAAAAAAATGTGCCCnTTC TTAC CACTCCTATAGNA 

AAGTCTGGTTAAGAAATAACCGTTGGTCITTATTTTCCl 1 1 i NTTTCCCCTTCCCTTGGG 

NCTTCCTGGGGCTCGG 
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I PKESRNVN YSLAS FLKCCLTLMDRG FVFNLIN DYIS — GFSPKDPKVLAEYKFEFL 

ALEETRRATHSVARFLI^CFTFNDRGCVFKMVK NYIS — MFSSGDLKTLCQYKFDFL ~7 • I 

DTEMVERLNTSIAFFLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVI^SLRLDFL 3.1 / 3-2. 
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RWCNHEHYIPLNLPM PFGKGRIQR- YQDLQL DYSLTDEF 

RWCNHEHYIPLNLPM PFGKGRIQR : YQDLQL DYSLTDEF 

QT I CNHEHY I PLNL PM- AFAKPKLQR VQDSNL EYSLSDEY 

QEVCQHEHFIPLCLPIRSANIPDPLTPSES TQELHASDMPEYSVTNEF 

RIICSHEHYVTLNLPCSLLTPPASPSPSVSSATiSQSSGFSTNVQDQKXANMFELS — VPF I4..I ( l+.T- 
-MNADTAPTSPCPSIS— SQNSSSCSSFQDQKIASMFDRTSRVPA 
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HC4 
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HC3 
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CRNHFLVGLLLREVGTALQEFRE — ■ — VRLIAISVLKNLLIKHSFDDRYASfIsHQARIAT 3- i 
CRNHFLVGLLLREVGTALQEFRE VRLIAISVLKNLLIKHSFDDRYASKSHQARIAT 

CKHHFLVGLLLRETSIALQDNYE IRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQ 

CRKHFLljSILLREVGFALQEDQD VRHLAIAVLKNLMAKHSFDDRYREPRKQAQ1AS f- I 

RQQHYLAGLVLTFJAVILDPDAEGLFGLHKKVINMVHNLLSSHDSOPRYSDPQIKARVAM 
SSTS-SPGLLFTELAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAA 
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LYLPLFGIilEKVQRIKVRDVSPFPVHAG-MTVKDESI^PAVNPLVTPQKGSTLDNSLH 
LYLPLFGLLIENVQRINVRDVSPI!PVNAG-MTVKI)ESUaPA^ 

LYLPFVGLLLEN IQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP — AN — RGSLS 

LYMPLYGMLLDNMPRI YLKDLYPFTVNTSNdGSRX)DLSTNG<5FQSQTAIKHANSVDTS FS <\ . I 

LYLPLIGI IMETVPQLYDFTETHNQRGRPICIATDDYESE SG SMIS 

LYLPLVGI ILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 

kdllgaisg^pyttstpninsvrnadsrgslistdsgnslpernseksnsldk}iqqss ( 5-V 

KDU^ISGIASPYTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 

tdkdtaygsfqng 

kdvlnsiAfss 

qtvamaiagtsvpq— 
qnvalaiagnnfn 



HGIKREDSRGSLI P-EGATGFPDQGNTGEN TRQS 

-IAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPL 10\ 
-LTRPGSFLLT^TSGRQHT— 3 - J 



-LKTSG— IVLSSjLPYKQYN 



1.1 



HC2A TLGNSVVRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

KIAA TLGN S WRCDKLDQSE I KSLLMCFLYI LKSMSDDALFTYWN-KASTS ELMDFFT I SEVCL 

rat • : 

HC4 STRSSVSQYNRLDQYEIRSIiMCYLYIVKMISEDTLLTYWN-KVS PQELINILILLEVCL 

HC1 ALIGSTLRFDRLDQAETRSLLMCFLHIMKTISYETLIAYHQ-RAPSPEVSDFFSIlbvCL 1 1. I ( 2_ 

HC3 TFSAESSRSLLI CLLWVLKN - ADETVLQKW FTDLS VLQLNRLL DLLYLCV 

HC5 MLNADTTRNLMICFLWIMKN-ADQSLIRKWIADLPSTQLNRILDLLFICV 



HC2A HQE|QYMGKRYIARNQEGLG — PIVHDRKS — QTLPVSRNRTGMM I 

KIAA HQFQYMGKRYIAR TGMM 

rat • 

HC4 FHFRYMGKRNIARVHDAWLSKHFGIDRKS QTMPALRNRSGVM 

HC1 QNFRYLGKRNIIRKTAAAF — KFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHK 

HC3 SCFEYKGKKVFERMNSLTFK-- KSKDMRAK LEEAILGSIGARQEMV 

HC5 LCFEYKGKQSSDKVSTQVLQ— KSRDVKAR LEEALLRGEGARGEHM 



HC2A HARLQQL GSLDNS LT FNHS YGHSDADVLHQSLLEANIATEVC . 

KIAA HARLQQL GSLDNS LTFNHS YGHSDADVLHQSLLEANIATEVC 

rat 

HC4 QARLQHL SSLESS FTLNHSSTTTEADI FHQALLEGNTATEVS 

HC1 QHRSQTLRI IRGK NALSNPKL LQMLDNTMllSNSNEI DIVHHVDTEANIATEGC [Z.i It 12. 

HC3 RRSRGQLpRSPSGSAFGSQENLRWRKE>MTHWRQNTEKLDKSRAEIEHEALIDGNLATEAN g.{ ( 6- "2- 

HC5 RRRAPGN DRFP GLNENLRWKKEQTHWRQANEKLDKTKAELDQEALI SGNLATEAH 

HC2A LTALDTLSLFTLAFK^QLLADHGHNPLMKKVFDVYLCFIjQKHQSETALKNVFTALRSLIY 1 . I 

KIAA LTALDTLSLFTIAFKNQLLADHGHNPI24KICVFDVYU:FI^KHQSETALKNVFTALRSLIY 

rat KLSRGHSPLMKKVFDVYLCFLQKHQSEMALKNVFTALRSLIY 

hc4 ltvldtisfftqcfkthflnndghnplmkkvfdihiafll^ngqsevslkhvfaslrafis 

hci ltildlvslftqthqrqlqq^dcqnslmkrgfdtymlffqvnqsatalkhvfaslrlfvc| 13.1 

hc3 liildtleiwqtvs — wes— -kesilggvlkvllhsmacnqsavylqhcfatqralvs 

hc5 liildmqeniiqass — aldc — kdsllggvlrvlvnslncdqsttylthcfa|rlralia 1 . j 

hc2a kfpstfyegradmcaalcyeilkccnski^sirteasqllyflmrnnfdytgkksfvrth 

kiaa kfpstfyegradmcaalcyeilkccnsklss i rteasqllyflmrnn fdytgkksfvrth 

rat kfpstfyegradmcasixtyevljcccnsklssirteasqllyflmrnn fdytgkksfvrth 

hc4 kfpsaffkgrvnmcaafcyevlkcctskisstwjeasaijjyijjouwfeot 

HCI KFPS AFFQGPADLCGS FC YEVLKCCNHRSRS TQT EAS ALL YLFMRKN FEFNKQKS IVRSH 

HC3 I^PEl^FEEETEQC^IX^jaJ^CSSSIGTIRSHPSASLYLUmQNFEIGN — NFARVK ~\. f / 1-2- 

HCS KFG DLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFS PGATS-— NFARVK 



HC2A LQVIISVSQLIADWGIGETRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

KIAA LQVI I S VSQLIADWG I GGT RFQQSLS I INNCANS DRLIKHTSFSS DVKDLTKRI RTVLM 

rat LQVIISI^QLIADWGIGGTRFQQSLSIINNCANSDPJjIKHTSFSSDVKDLTKRIRTVLM 

HC4 LQI I IAVSQLIADVALSGG S RFQES LFI INN FAN S DRPMLARAFPAEVKDLTKRIRTVLM . 

HCI LQ^IKAVSQLIAD-AGIGGSRFQHSIJ^TNNFANGDKQMKN]3NFPAEVKDLTKRIRTVLM fl+.t / IU.ZI [g~J I 

HC3 MQVPMSLSSLVGTSQNFNEEFLRRSIJCTILTYAEEDLELRETTFPDQVQDLVFNLHMILS ' 

HC5 MQVTMSLASLVGRAPDFNEEHIJUlSLRTILAYSEEDTAMQMTPFPTQVIXLIXaiLNSILY 
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PUf.: 



Transmembrane 



ATA^KEHENDPEKLVDLQYSUUCSYASTPELRKTWLDSMARIHVia^GDLSE^AMCYVHV 
ATAQMKEHENDPEMLVDLQySLAKSYASTPELRKTWLDSMARIHVKNGD jSEAAMCYVHV 
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DTVKMKEHQEDP EML I DLMYRIAKG YQTS PDLRLTWLQNMAGKHSERSN 1AEAAQCLVHS 
DTVKMREPQEDPEMLMDLMYRIAKS YQAS PDLRLTWLQNMAEKHTKKKC fTEAAMCLVHA 
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TRKEAD 1 

HRKKL- 



LALQREPPVFPYSHTSCQRK 



iRGGMFRQGCTAFRVITPN 
FPNGCSAFKKITPN 



-RKYIiPVGCVTFCfclSSN 0. ( (9.1. 
-HSYLPVGSVSFQNISSN 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



I DE^^SMHE DVGpQD- : VH FNE DVLMELLEQCADGLWKAERYELIADI YKL 1 1 PI <? . { 

IDf^pllMEDVGWQD VHFNEDVLMELIXQC»DGLWKAF«iJr^IABIYKLIIPI 

IDEEJ^^MEPVGMQD- VHFNEDVLMFXLEQCADGLWKA^S&^GLLTSINSSSP 

I DEFJSAT4KEDAGMMD VHYSEEVLLELLEQC^VNGLWKAERYEIISEISKLIGPI 

I KEEGAAKEDSGMHD -TPYNliNILVEQLYMCGEFLWKSERYELIADVNKPIIAV | 1 ./ I 

VLEBSAVS DDVVS PDEEGI CSGKYFTESGLVGLLEQAAAS FSMAGMYEAVNEVYKVIjI PI 
VLEESWSEDTLS PDEDGVC^GQYFTKGLVGLLEQAAELFSTGGLYETVNEVYKLVI PI 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



•FERLAHL fDTL 1R? YSKV ?EVMHS( 
SMKSGGTLETTHL fDTL 1RI YSKV 'EVITR- 
YENRREFENLTQV ( RTL 1GJ YTKI ^EVMHTKKRLLG — 

FEKQRDFKKLSDL fYDI IR5 YLKV lEWNSEKRLFG 

HEfiiREAKKLSTi 1GKL JE? FSKX TOC^TGWERMFG 

LEAHREFRKLTLT 1SKL )PJ FDSI TOKDH — KRMFG 



SGRRLLG T^ FRN^ AFFGQAAQYQFT PS ET DVE 
-AGSWDLLPGGLFGQ 



— TFJEB3ZAFYGQ 

R fYRV VFYGQ 

T fFRV ;FYG- 

TfCEBYjSFFG- 



^-1 



HC2A -FFEDEDGlCEYIYKEPKLTPLSEISQRLLKLYSDKFGSENVKMIQDSGKfTNPKDLDSKYA 

KIAA GFFEDEDGKEYXYKEPKLTPLSEISQRLLKLYS DKFGSENVKMIQDSGKVNPKDLDSKYA 

rat GFFBDEi>GKEYI YKEPKLT PLSEI SQRLLKLYS DKFGSENVKMIQDSGKVNPKDLDSKFA 

HC4 SFEEEEDGKEYIYKEPKLTGLSEISLRLVKLYGEKFGTENVKI IQDSDKVNAKELDPKYA 

HC1 GFJ^iiEEQCEYI YKF.PKLTGLSEI SQRLIJUjYADKFGACirVKI IQDSNKVNPKDLDPKYA 

HC3 TKFGDLX^EFVYKEPAXTfOJ^ISH^ \.C-\ ( {O-Z. 

HC5 SKFGDLDEQEFVYKEPAITKLPEISHRLEAFYGQCFGAEFVEVIKDSTPVpi^rKLDPHKA 4.. | 

HC2A YlGV^ji^^P-FFBEKEIfflERKT^ II-/ / " . 2_ 

KIAA yiqV*Hh^f4^ekelqerktefershnirrfmfempftqt^ 

rat YIQVTHVTPFFDEKELQERKTEF1»CHNIRRFMFEMPFT(^ 

HC4 HIQVTYVKP¥nM3KELTCRKTEFEP^ 

HC1 YIQVTYVTPFFEEKEIEDROTDFEMHHNINRFVFETPFTI^GKKHG l#. | 

HC3 YIQriYVEPYFDTYEMKDRITYFDICNYinJIRFHYCT 

HC5 YIQI*FVEPYFDEYEMKDRVTYFEKNFIILPJIFMYCT^ 



fib. U6(tc«h) 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



T Coiled-Col^ jT 

I HCFPYVKKRI PVMYQHHTDLN fllEVAI DEMSKKVAELRQLCSSAEVDMIKLQLKLQGfeV 
IHCFPYVKKRIPVMYQHHTDLNE I EVAI DEMSKKVAELRQLCS SAEVDMI KLQLKLQG 3V 
I HCFPYVKKRI PVMYQHHTDLN I I EVAI DEMSKKVAELHQLCS SAEVDMI KLQLKLQG 3V 
SN S FPYVKKRI PINCEQQINLKI IDGATDEIKDKTAELQKLCSSTDVDMIQLQLKLQG iW 
SHLFPYVKKRIQVISQSSTELNI I EVAI DEMSRKVSELNQLCTMEEVDMI SLQLKLQG 5V 
SHAFPY I KTRVNVTHKEjEI ILTI IEVAIEDMQKKTQELAFATHQDPADPKMLQMVLQG 3V 
MHAFPYIKTRISVIQI^EFVLT HlEVAIEDMKKlCTLQIAVAINQEPPD 



Coiled-Coil 2 



SVC^AGPIAYARAFLDDTNTIWYPDNKVKLLKEVFRQFViACGQALAWERLIKEDQLE 
SVQvNAGPIAYAPAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQA jAVNERLIKEDQLE 
SVQVNAGPIJVYARAFLDDTNTKIIYPDNKVKLLKEVFRQFVEACGQA .AVNERLIKEDQLE 
S VQ VN AG PLA Y ARAFLN DS QAS KY P PKKVS ELK DM FRKF I Q AC S I A jELNERLI KEDQVE 
SVKVNAGPMAYARAFLEETNAKKYPDNQVKLLKEI frqfadacgqa ..DVNERLIKEDQLE 
GTTVNQGPLEVAQVFLSEI PSDPKLFRHHNKLRLCFKDFTKRCEDA -.RKNKSLIGPVQKE 
GATW<^PLEVAQVFIAEIPADPKLYRHHNKLR1X3FKEFIMRCGE^^ 



Colled-Coll 2 



YQFXMKANYREMAKELSEIMHEQI 
YQEEMKANYREMAKELSEIMHEQI 3- 



I PLEEKTS - VLFN SLHI FNAI SGT PT STMVHGMTS S 



Ref- = 



u- 1 



u-f / 12.1 



YQEEMKANYREI RKELS DITVPRlfc PGEDKRATKFPAHLQRHQRDTNKHSGSRVDQFILS 
YHEGLKSN FRDMVKELS DI IHEQI ^QEDTMHSPWMSKTLHVFCAISGTSSDRGgG^PRYA 

1 1 rGRDDLSK RGVDQTCTRVI SKAT PALPTVS I SS | «\ . | 



yqeelrshykdmlselstv^eq: 
yqrelg klss 

yqqeijotiynklkenlrpmierkljpelykpifrvesqkrdsfhrssfrkcetqlsqgsz 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



LPHE PHVGTC FVMCKLRTT FRANHWFCQAQEEAMGNGREKEPWTVI FN SRFYRS WGK 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 




genomic DNA BAC 6 DNA 




Fldi.S. 




none GST alone T1AM-1 PSD95 DLG1 neDLG 
GST-PDZ fusion protein 



o 
o 



0.35 r 
0.3 7 

0.25 | 
0.2 

0.15 
0.1 

0.05 
0 



□ 5 uM CLASP2 

M 5 uM CLASP2 + 1 00 uM CLASP2 Inhibitor 
■ 5 uM CLASP2 + 1 0OuM KV1.3 Inhibitor 



PSD95 



NeOLG 



PDZ Protein 



1 AATTGTAATA 

81 GAATTCGGCA 

161 ATGAAAAGCA 

241 GTCGTTGAAA 

321 GGTCTCGGCG 

4 01 GGGTAGATGG 

4B1 ATTTTTTCCA 

561 CATGCGATGG 

721 ACTTGAGCTC 

801 ACCAAATCCA 

881 CTT7GATGTA 

961 CTGCATCCTA 

1041 GAGGCATCTA 

L CAAGCAGATC 

I GACCTCCAGC 
L GACAGCCCTC 
L ATGACAGATA 
L GTCCAGCGGA 
I ACCAGCTGTG 
L TCTCCGGCAT 
L ATAAGCACAG 
I CACATTGGGA 
t TCTTAAAGAG 
L ATATCTGAAG 
I AGTTCATGAT 
I GCAGCCTGGA 
L GCCAACATTG 
L GGCCGACCAT 
I CGGCTTTAAA 



2S61 
2641 
2721 
2801 
2881 
2961 
3041 
3121 
3201 
3281 
3361 
3441 
3S21 
3601 
3681 
3761 
3841 
3 921 



. ATGTG" 
GCTCTACTTC 
CTGTCAGCCA 
GCCAACAGtG 
AATGGCCACC 
ATGCCAGCAC 
GCAGCAATGT 
CGCCTTCAGG 
ACGAGCATGT 
ATCTACAAAC 



I 20 
CGACTCACTA 
CGAGTTTTAC 
CCACCTGTTG 
CCCAAGTTGG 
AACCTTCCTT 
AGGCAAGCCA 
GTACTGTCAG 
AAGGCCACGT 
GTCGCGGTTA 
ATATGTTAAG 
TGACCACGAT 
CTGATCAAAT 
TCATCATGCA 
AGAACGCGAA 
AACAACTACA 
CAACCATGAA 
TTGACTACTC 
CAGGAGTTCC 
TGCTTCAAGG 
TCAATGTGAG 
AATCCGCTGG 
TGCTTCTCCA 
ATTCGGGTAA 
AATTCCGTGG 
CATGTCTGAT 
TCTGCCTGCA 
CGAAAGTCTC 
TAACTCTCTC 
CTACTGAGGT 
GGACATAATC 
AAATCTCTTC 



TCAAAATGAT 



TGAGATGCCA 
TACACTGCTT 
ATTGACGAGA 
CAAACTCCAG 
CAAAGCGATA 
GCGGTAAACG 
GGAGCTTTCT 
TCAACGCCAT 
ATGGCCCGTG 
CAGGGAGGAC 
AGAAGGTGCA 
TCTTGAGCTG 
ATCTACATGG 
AATCAGGGTA 
ATAGAGCAAG 
TCATCGACTC 
TGTGATAAAT 
TGTATACAAG 
I 10 



CTC 

CTGATGAGGA 
GCTGATAGCA 
ACCGGCTTAT 
GCCCAGATGA 
GCCCGAGCTC 
GCTATGTCCA 
GTCATTACCC 
GCTGATGGAG 
TTATCATCCC 
CTCACACCGC 
ACAGGATTCT 
TTGACGAAAA 
TTTAOGCAGA 
CCCTTATGTG 
TGAGTAAGAA 
GGCAGCGTGA 
TCCTGACAAT 
AACGTCTGAT 
GAAATCATGC 
CAGTGGGACT 
TGTGGGGACT 
CAAGGGGAAG 
CATATTTTTT 
GACTTAGATT 
CAGGGAGGGC 
CAGAACTTAC 
AATAGTGAGC 
CGGGACGGTC 
TTGTGCT GC T 



I 30 | 40 

TAGGGCGAAT TGGGTACCGG 
ACCATCACCA AAACCCAGAA 
CTCACATTCT TCCATGTCAG 
CTACTCCTGG CTTCCCCTCC 
CGGGCTATCT TGGCTACCAA 
CTGCTGAAAA TTTCCACTCA 
AAAACCGAAT CTCGAGCCCA 
GATGATCGCC TTCTTGCCCA 
ACGTGACTCG GGTCATTATT 
TACGCGTATA AGGCTGAGCC 
TCTCAAGCCT TCTGCCGATT 
CTATGGCTCA GCATTTGATA 
GCGGAAACCG TTCTAAATAT 
TCATAGCCTT GCTGTCTTCA 
TTAGCTGTTT TGCTCCTGGA 
CATTATATTC CCTTGAACTT 
ATTAACAGAT GAGTTCTGCA 
GGGAGGTCCG TCTGATCGCC 
AGCCATCAGG CAAGGATAGC 
GGATGTGTCA CCCTTCCCTG 
TGACGCCGCA GAAGGGAAGC 
TATACAACCT CAACTCCAAA 
CAGCCTTCCA GAAAGGAATA 
TTCGCTGTGA TAAACTTGAC 
GATGCTTTGT TTACATATTG 
CCAGTTCCAG TACATGGGGA 
A GACAT TGCC TGTTTCCCGT 
ACTTTTAACC ACAGCTATGG 
TTGCCTGACA GCTCTGGACA 
CTCTCATGAA AAAAGTTTTT 
ACTGCCTTAA GGTCCTTAAT 
CGAG ATTCT C AAGTGCTGTA 
ACAACTTTGA TTACACTGGA 
GACCTTGTTG GCAV1GGGGA 
TAAGCACACC AGCTTCTCCT 
AGGAGCATGA GAACGACCCA 
AGGAAGACGT GGCTCGACAG 
CGTAACAGCC CTAGTGGCAG 
CAAACAT CGA CGAGGAGGCC 
CTCCTTGAGC AGTGCGCAGA 
CATTTATGAG AAGCGGAGGG 
TGTCGGAAAT TTCTCAGAGA 
GGC AAGGT CA ACCCTAAGGA 
AGAGTTGCAA GAAAGGAAAA 
CCGGGAAGAG GCAGGGCGGG 
AAGAAGCGCA TCCCTGTCAT 
GGTGGCGGAG CTCCGGCAGC 
GTGTTCAGGT CAATGCTGGC 
AAAGTGAAGC TGCTTAAGGA 
TAAAGAAGAC CAGCTCGAGT 
ATGAGCAGAT CTQCCCCCTG 
CC AACAAGC A CAATGGTTCA 
TGCTTTGTCA TTTGCAAACT 
GGGAGAGAAA GGAAATAAAG 
TAAATCTCAC TGGCAATATT 
TTATTCTTCC TTGCAGAGTA 
TG CACTGA CA TTGATGCCTG 
TAGTTTTGTC TAGGAGTATG 
TAACTGAGCT AGACACTCAA 
ATATATGTAT TACATTTCTA 
CCAGTATATG CAATACACTT 



I 50 
GCCCCCCCTC 
TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCTTGGGA 
TCTGGTTTCT 



I 



60 



CATGTGGTTG 
ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
TGAACGCGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGCGATACAT 
AACAGAACAG 
CCACTCGGAC 
CGCTTTCTCT 
GATGTCTACC 
TTATAAGTTT 
ACTCCAAGCT 
AAGAAGTCCT 
AACCAGATTC 
CTGATGTGAA 
GAGATGCTGC 
CATGGCCAGG 
AATATCTCAC 
TCCATGATGG 
TGGACTCTGG 
ATTTCTTTGA 
CTCCTTAAAC 
TCTGGATTCT 
CAGAGTTTGA 
GTGGAAGAGC 
GTACCAGCAC 
TGTGCTCCTC 
CCACTAGCAT 
AGTTTTCAGG 
ATCAGGAAGA 
GAGGAGAAGA 
CGGGATGACC 
CAGGATGCTT 
AACAACGTTA 
CAAAGTTTTC 
GTGTTAGAAT 
GGGGACCTTT 
TTGTATGACT 
TTAATCCGCT 
CATTTTTAAT 
TAATGGTTTA 



GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACCTTC 
CCCTCTTTGA 
TTTGGAAAAG 
CTTGGTGGGA 
TCAAGAACCT 



I 



CTGCCTC 
CATGACCGTG 
ACAGCCTGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAACATCTG 
AGCCAGGAAC 
GAATGATGCA 
GCAGATGTTC 
ATTTACATTG 

CCCTCAACAT 
GAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 
ACGGAAAGGC 
AAGACGTGGG 
AAAGCCGAGC 
AGATGAAGAT 
TGTACTCGGA 
AAGTATGCAT 
GAGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGTG 
ATGCGCGAGC 
CAATTTGTGG 
AATGAAAGCC 
CGAGCGTCTT 

TCCAAAGCCA 
TTTCTTAACA 
ATTGTGTCTT 
AGATGGCCTA 
TGCCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACATGG 
TTCTTGTCAT 



70 

GTATCGATAA 
AGAGTTGCCC 
GAAGCACGAA 
ACAAGCGAGC 
TTATGGTCCG 
TCAGGATCAG 
TAAAGTACCT 
CGAGTCCTCA 
TGAGGAAGGA 
AGACAGTGCA 
CTGAGGTACT 
GCTGCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 
GCAGGATTCA 
CTGTTACTGA 
GCTGATAAAG 
TTGGTCTGCT 
AAGGATGAAT 
CAAGGACCTG 
CTGATTCGAG 
GATAAGCACC 
ACTGATGTGT 
AACTTATGGA 
CAGGAGGGGT 
TGCCAGATTG 
TGCACCAGTC 
GCGTTTAAGA 
TCAAAAACAT 
TCTATGAAGG 
AGGACGGAGG 
ACATTTGCAA 
TGTCCATCAT 
AAAAGGATAC 
GTACAGCCTG 
AAAATGGCGA 
GTGTTTAGAC 
GATGCAGGAT 
GCTACGAGCT 
GGAAAGGAGT 
TAAATTTGGT 
ACATCCAGGT 
AACATCCGCC 
GCGCACCATC 
TGAACCCCAT 
GACATGATCA 
TTTCTTAGAT 
AAGCTTGCGG 
AACTACAGQG 
ACCGAATTCC 



ATCACTGGGG 
GACTTTCTAT 
AACAAAGGTG 
CAGAAAAAAA 
CGTGCCGGAA 
CTATTATCTC 
AGTCAGAACT 
GCTTATGCAT 
AAAAATGTGC 



I 30 



I 40 



I 



60 



I 80 
GCTTGATATC I 
ACTCAGCTGC 2 
GAAGAGGGAT J 
AGCACATCCC 3 
GAAATTAAAT 4 
CATTTACATA 4 
TAAGAGTCTG « 
CCAGAGCCAC « 
TTGGAGAGCC Z 
TGAAGAACTG I 
CATGGTTTTT f 
CAGAGATTTC « 
AGATAATCCA 1 

GcrrroTcrr j 

GAATTTCTCC 
AAGATACCAA 
GGGAGGTGGG 
CATTCTTTTG 
GATTGAAAAC 
CCCTGGCTCT 
CTGGGCGCCA 
AGGATCTCTC 
AACAAAGTAG 
TTCCTCTACA 
TTTTTTTACA 
TGGGACCCAT 
CAGCAGCTGG 
ATTACTTGAA 
ACCAGCTCCT 
CAGTCTGAAA 
GAGAGCGGAC 
CCTCCCAGCT 
GTCATCATAT 
CAACAACTGT 
GCACGGTGCT 
GCCAAATCCT 
TCTCTCAGAG 
AAGGATGCAC 
GTCCATTTCA 
CATCGCCGAC 
ATATTTACAA 
TCTGAAAATG 
GACTCACGTC 
GCTTCATGTT 
CTGACAGCCA 
CGAGGTCGCC 
AACTGCAGCT 
GATACAAACA 
TCAAGCCTTA 
AAATGGCGAA 
CTTCACATCT 
ATTACATCTC 
AGACCGAGCA 
AGGAGTTGTA 
TGGTAGACAC 
AGGTTCTGGG 
ATCTGATCGT 
ATTCAACAAC 
riGTCATTAA 
TAAGTTTAAT 
AATATGGAGA 

80 



I 



1280 
136.0 
1440 
1S2 0 
1600 
1680 
1760 
18.40 
1920 
2000 
2080 
2160 
2240 
2320 
2400 
2480 
2S60 
2640 
2720 
2800 
2880 
2960 
3040 
3120 
3200 
3280 
3360 
3440 
3520 
3600 - 
3680 
3760 
3840 



4080 
4160 
4240 
4320 
4400 
4480 
4560 
4 64 0 
4720 
4800 
880 
S98 



f IDA. 



I 10 I 20 I 30 | 40 

1 MEGHVMIAFL. PTILNQLFRV LTRATQEEVA VKVTRVTIHV 
81 SMTTILKPSA DFLTSNKXXR YSWFFFDVLI XSMAQHLIEN 
161 SKNANHS1AV FIKRCFTFMD RGFVFXQINN YISCFAPGDP 
241 QLDYSLTDEF CRNHFLVGLL LREVGTALQE FREVRLIAIS 
321 RINVRDVSPF PVKAGMTVKD ESLALPAVNP LVTPQKGSTL 
4 01 TDSGNSLPER NSEKSNSLDK HQQSSTLCNS WRCDKLDQS 
4 81 EVCLHQFOYM GKRYIARNQE GLGPIVHDRK SQTLPVSRNR 
561 IATEVCLTA1, DTLSLFTLAF KNQLLADHGH KPLHKXVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFLM RNNFDYTGKK 
721 SDRXiIKHTSF SSDVKDLTJCR IRTVLMATAQ MXEHENDPEM 
801 MCYVHVTALV AEYLTRKGVF RQGCTAFRVI TPNIDEEASM 
B81 KLIIPIYEKR PJDFFEDEDGK EYIYKEPKLT PLSEISQRLL 
961 FFDEKEMJER KTEFERSHNI RRFMFEMPFT QTGKRQGGVE 
1041 EMSXXVAELR QLCSSAEVDM IKLQLXLQGS VSVQVHAGPL 
1121 NERLIKEDQL EYQEEMKANY REMAKELSEI MHEQICPLEE 
| 10 | 20 I 30 | 40 



| 50 | 60 | 70 | 80 

VAQCKEEGLE SHLRSYVKYA YKAEPYVASE YTTVHEELTX 8 0 
SKVKLLRNQR FPASYHHAAE TWNMLMPHI TQKFGDNPEA 160 
KTLFEYXFEF LRWCNMEHY IPLNLPMPFG KGRIQRYQDL 24 0 
VLKNLLIKHS FDDRYASRSH QARIATLYLP LFGLLIENVQ 3 20 
DNSLHKDLLG AISGIASPYT TSTPNINSVR NADSRGSLIS 4 00 
EIKSLLMCFL YXLKSKSDDA LFTYWNKAST SELMDFFTIS 4 80 
TGMMHARLQQ LGSLDNSLTF NHSYGHSDAD VLHOSLLEAN 560 
YLCFLQKHQS ETALKNVFTA LRSLIYXFPS TFYEGRADMC £40 
SFVRTHLQVT ISVSQL.IADV VGIGETRFQQ S LSI INN CAN 720 
LVDLQYSLAK SYASTPELRK TWLDSMARIH VKNGDUSEAA 800 
MEDVGMQDVH FNEDVLMELL EQCADGUfKA ERYELIADIY 880 
KLYSDKFGSE NVKMIODSGK VNPKDLDSKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RI PVMYQKHT DLKPIEVAID 104 0 
AYARAFLDDT NTKRYPDNKV KIXKEVFRQF VEACGQAXAV 1120 
KTSVLPNSLH 1FNAISGTPT STMVHGMTSS SSW 1194 

I SO I 60 I 70 I 80 



20 



1 AATTGTAATA 
81 GAATTCGGCA 
161 ATGAAAAGCA 
241 GTCGTTGAAA 
321 GGTCTCGGCG 
4 01 GGCTAGATGG 
4 81 ATTTTTTCCA 
561 CATGCGATGG 
641 ACAGGAAGAA 
721 ACTTGAGGTC 
8 01 ACCAAATCCA 
881 CTTTGATGTA 
961 CTGCATCCTA 
1041 GAGGCATCTA 
1121 CAAGCAGATC 
1201 GTGTAGTGTG 
1281 GACCTCCAGC 

13 61 GACAGCCCTC 

14 41 ATGACAGATA 
1S21 GTCCAGCGGA 
1601 ACCAGCTGTG 
1681 TCTCCGGCAT 
17 61 ATAAGCACAG 
1S41 CACATTGGGA 
1921 TCTTAAAGAG 
2001 ATATCTGAAG 
2081 AGTTCATGAT 
2161 GCAGCCTGGA 
2241 GCCAACATTO 
2321 GGCCGACCAT 
2401 CQGCTTTAAA 
24 81 ATGTGTGCGG 
2561 GCTCTACTTC 
2641 CTGTCAGCCA 
2721 GCCAACAGTG 
2801 AATGGCCACC 
2881 ATGCCAGCAC 
2961 GCAGCAATGT 
3041 OGCCTTCAGG 
3121 ACGAGGATGT 
3201 ATCTACAAAC 
3281 GGAACCCAAA 
3361 TCAAAATGAT 
3441 ATCCCCTTCT 
3S21 TGAGATGCCA 
3601 TACACTGCTT 
3681 ATTGACGAGA 
3761 CAAACTCCAG 
3841 CAAAGOGATA 
3921 GCGGTAAACG 
4001 GGAG CTTTCT 
4081 TCAACGCCAT 
4161 ATGGCCCGTG 
4241 CAGGGAGGAC 
4321 AGAAGGTGCA 
4401 TCTTGAGCTG 
4481 ATCTACATQG 
4 561 AATCAGGGTA 
4641 ATAGAGCAAG 
4721 TCATCGACTC 
4 801 TGTGATAAAT 
4881 TGTATACAAG 



I 



1 I JO I 40 

CGA CTCAC TA TAGGGCGAAT TGGGTACCGG 
CGAGTTTTAC ACCATCACCA AAACCCAGAA 
CCACCTGTTG CTCACATTCT TCCATGTCAG 
CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
AACCTTCCTT CGGGCTATCT TGGCTACCAA 
AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
AAGGCCACGT GATCATCGCC TTCTTGCCCA 
GTCGCGGTTA ACGTGACTCG GGTCATTATT 
ATATGTTAAG TACGCGTATA AGGCTGAGCC 
TGACCACGAT TCTCAAGCCT TCTGCCGATT 
CTGATCAAAT CTATGGCTCA GCATTTGATA 
TCATCATGCA GCGGAAACCG TTGTAAATAT 
AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
AACAACTACA TTAGCTGTTT TGCTCCTGGA 
CAACCATGAA CATTATATTC CGTTGAACTT 
TTGACTACTC ATTAACAGAT GAGTTCTGCA 
CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
TGCTTCTCCA TATACAACCT CAACTCCAAA 
ATTCGGCTAA CAGCCTTCCA GAAAGSGAATA 
AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
CATGTCTGAT GATGCTTTGT TTACATATTG 
TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
TAACTCTCTC ACTTTTAACC ACAGCTATGG 
CTACTGAQGT TTGCCTGACA GCTCTGGACA 
GG ACATAATC CTCTCATGAA AAAAGTTTTT 
AAATG1V1TC ACTGCCTTAA GGTCCTTAAT 
CTCTGTCnTA CGAGATTCTC AAGTGCTCTA 
CTGATGAGGA ACAACTTTGA TTACACTGGA 
GCTGATAGCA GACGTTGTTC GCATTGGGGA 
ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
GCCCAGATGA AGGAGCATGA GAACGACCCA 
GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
GCTATGTCCA CGTAACAGCC CTAGTX3GCAG 
GTCATTACCC CAAACATCGA CGAGGAGGCC 
GCTGATOGAG CTCCTTGAGC AGTGCGCAGA 
TTATCATCCC CATTTATGAG AAGCGGAGGG 
CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
ACAGCATTCT GGCAAGGTCA ACCCTAAGGA 
TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
CCCTTATGTG AAGAAGCGCA TCCCTCTCAT 
TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
GGCAGOGTGA GTGTTCAGGT CAATGCTGGC 
TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
CAGTGQGACT CCAACAAGCA CAATCGTTCA 
TGTGGGGACT TGCITTCTCA TTTGCAAACT 
CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
CATATTTTTT TAAATCTCAC TGGCAATATT 
GACTTAGATT TTATTCTTCC TTGCAGAGTA 
CAGGGAGGGC TGCACTGACA TTGATGCCTG 
CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
AATAGTCAGC TAACTGAGCT AGACACTCAA 
CGGGACGGTC ATATATGTAT TACATTTCTA 
T TCTQ CTGGT CCAGTATATG CAATACACTT 
TCTTTACT 

• 20 I 30 | 40 



I 50 

Gceeecccrc 

TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCTTGGGA 
TCTGGTTTCT 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTG 
ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
TGAACGCGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGCGATACAT 
AACAGAACAG 
CCACTCGGAC 
CGCTTTCTCT 
GATGTCTACC 
TTATAAGTJT 
ACTCCAAGCT 
AAGAAGTCCT 
AACCAGATTC 
CTGATGTGAA 
GAGATGCTGG 
CATGGCCAGG 
AATATCTCAC 
TCCATGATGG 
TGGACTCTGG 
ATTTCTTTGA 
CTCCT TAAAC 
TCTGGATTCT 
CAGAGTTTGA 
GTGGAAGAGC 
GTACCAGCAC 
TGTGCTCCTC 
C CACTA GCAT 
AGTTTTCAGG 
ATCAGGAAGA 
GAGGAGAAGA 
CGGGATGACC 
CAGGATGCTT 
AACAACGTTA 
CAAAGTTTTC 
GTGTTAGAAT 
GGGGACCTTT 
TTGTATGACT 
TTAATCCGCT 
CATTTTTAAT 
TAATGGTTTA 

I 50 



| 60 
GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACCTTC 
CCCTCTTTGA 
TTTGGAAAAG 
CTTGGTGGGA 
TCAAGAACCT 
CTGCCTCTGT 
CATGACCGTG 
ACAGCCTGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAACATCTG 
AGCCAGGAAC 
GAATGATGCA 
GCAGATGTTC 
ATTTACATTG 
■TGT G l MTn.T 
CCCTCAACAT 
CAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 
ACGGAAAGGC 
AAGACGTGGG 
AAAGCCGAGC 
AGATGAAGAT 
TGTACTCQGA 
AAGTATGCAT 
6AGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGTG 
ATGCGCGAGC 
CAATTTGTGG 
AATGAAAGCC 
CGAGCGTCTT 
AGCTCGTCTT 
TCCAAAGCCA 
TTTCTTAACA 
ATTGTGTCTT 
AGATGGCCTA 
TGCCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACAT GG 
TTCTTGTCAT 

60 



I 



70 



GTATCGATAA 
AGAGTTGCCC 
GAAGCACGAA 
ACAAGCGAGC 
TTATGGTCCG 
TCAGGATCAG 
TAAAGTACCT 
CGAGTCCTCA 
TGAGGAAGGA 
AGACAGTGCA 
CTGAGGTACT 
GCTGCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 
GCAGGATTCA 
CTGTTACTGA 
GCTGATAAAG 
TTGGTCTGCT 
AAGGATGAAT 
CAAGGACCTG 
CTGATTCX1AG 
GATAAGCACC 
ACTGATGTGT 
AACTTATGGA 
CAGGAGGGGT 
TGCCAGATTG 
TGCACCAGTC 
GCGTITAAGA 
TCAAAAACAT 
TCTATGAAGG 
AGGACGGAGG 
ACATTTGCAA 
TGTCCATCAT 
AAAAGGATAC 
GTACAGCCTG 
AAAATG GCGA 
GTGTTTAGAC 
GATGCAGOAT 
GCTAOGAGCT 
GGAAAGGAGT 

TAA A 1 rrocT 

ACATCCAGGT 
AACATCCGCC 
GCGCACCATC 
TGAACCCCAT 
GACATGAXCA 
TTTCTTAGAT 
AAGCTTGCQG 
AACTACAQOG 
ACOGAATTCC 
CQGTCGTGTG 
AT CACT GGOG 
GACTTTCTAT 
AACAAAGGTG 
CAGAAAAAAA 
CGTGCCGGAA 
CTATTATCTC 
AGTCAGAACT 
GCTTATOCAT 
AAAAA.TGTGC 



I 



80 



GCTTGATATC 
ACTCAGCTGC 
GAAGAGGGAT 
AGCACATCCC 
GAAATTAAAT 
CATTTACATA 
TAAGAGTCTG 
CCAGAGCCAC 
TTGGAGAGCC 
TGAAGAACTC 
CATGGTTTTT 
CAGAGATTTC 
AGATAATCCA 
GC1TTGTCTT 
GAATTTCTCC 
AAGATACCAA 
GGGAGGTGGG 
CA'J 1 Li 1 1'lXi 
GATTGAAAAC 
CCCTTSGCTCT 
CTGGGCGCCA 
AGGATCTCTC 
AACAAAGTAG 
TTCCTCTACA 
TTTTTTTACA 
TGGGACCCAT 
CAGCAGCTGG 
ATTACTTGAA 
ACCAGCTCCT 
CAGTCTGAAA 
GAGAGCGGAC 
CCTCCCAGCT 
GTCATCATAT 
CAACAACTGT 
GCACOGTGCT 
GCCAAATCCT 
TCTCTCAGAG 
AAGGATGCAC 
GTCCATTTCA 
GATCGCOGAC 
ATATTTACAA 
TCTGAAAATG 
GACTCACGTC 
GCTTCATGTT 
CTGACAGCCA 
CGAGGTOGCC 
AACTGCAGCT 
GATACAAACA 
TCAAGCCTTA 
AAATGGCGAA 
CTTCACATCT 
ATTACATCTC 
AGACWSAGCA 
AGGAGTTGTA 
TSGTAGACAC 
AGGTTCTQGG 
ATCTGATCGT 
ATTCAACAAC 
TTGTCATTAA 
TAAGTTTAAT 
AATATGGAGA 



I 



70 



I 



80 



160 
24 0 
320 



560 
640 
720 
BOO 



1200 
1260 
13 60 
1440 
1520 
1600 
1660 
1760 
1840 
1920 
2000 
2080 
2160 
2240 
2320 
24O0 
2480 
2560 
2640 
2720 
2600 
2680 
2960 
3040 
3120' 
3200 
3280 
3360 
3440 
3S20 
3600 
3680 
3760 
3840 
3920 
4 000 
4080 
4160 
4240 
4320 
4400 
4480 
4560 
4640 
4720 
4600 
4660 
4898 



I 



20 



1 KEGHVMIAFL 
81 SMTTH-XPSA 
161 SKHANHSIAV 
241 QLDYSL.TDEF 

3 21 RIHVKDVSPF 

4 01 TDSGNSLPER 
4 81 EVCUHQFQYM 
561 XATEVCLTAL 
641 AAUCYEI1>KC 
721 SDRLIKKTSF 
801 MCYVHVTALV 
881 KLUPrSfEKR 
961 FFDEKELQER 

10 41 EMSKKVAELR 
1121 KERLXKEDQL 
1 10 



pXimQLFRV 
PFLTS NIOXR 
FIKRCFTFMD 
CRNHKLVGLL 
pVNAGMTWD 
NSEKSNSLDK 
GKRYXRKNQE 
DTI^SUTIAF 
CNSKLSS IRT 
ESDVKELTKR 
aEYLTRXGVF 
RDFFEDEDGK 
KTEFXRSHKI 
QLCSSAEVDM 
EYQEEMKAHY 
I 20 



I 30 
LTRATQEEVA 
YSWFFFDVLI 
RGFVFKQIHN 
L.REVGTALOE 
ES1ALPAVNP 
HQOSSTIXSNS 
GUGPIVHDRK 
KNQLiADHGH 
EASQLLYFLM 
IRTVmATAQ 
ROGCTAFRVI 
ETTTKEPKLT 
RRFMFEMPFT 
IKLQLKLQGS 
REMAJCELSEI 



I 



I <° 
VNVTRVIIHV 
KSMAQHLIEN 
YISCFAPGDP 
FREVRLIAIS 
LVTPQKGSTL 
WRCDFU3QS 
SQTL.PVSRNR 
NPLMKTVTDV 
RNNFDYTGKK 
MKEHENDPEM 
TPNIDEEASM 
P1>SEISQR1X 
OTGTRQGGVE 
VSVQVNAGPL. 
MHEQICPLEE 



30 



I 



I 50 
VAQCHEEGLE 
SKVKLLRNQP 
KTLFEY1CFEF 
VLKNLLIKHS 
DNSLHKDLLG 
EIKSLLMCFL 
TGMMHARUQQ 
YLCFLQKHQS 
SFVRTHLQVI 
LVDLQYSLAX 
MEDVGMQDVH 
KXYSDKFGSE 
EQCKRRTIL.T 
AYARAFIJJOT 

rrsvwNsm 

I 50 



I 60 I 7 ° I 80 

SKUISYVXYA YTAEFYVASE YTOVHEELTK 80 
FPASYKHAAE TWNMLMPHI TQKFGDNPEA 160 
URWCNHEHY IPLNLPMPFG KGRIQRYQDL 24 0 
FDDKYASRSH QARIATLYLP LFGLLIEHVQ 3 20 
AJSG1ASPYT TSTPKINSVR NADSRGSLIS 40O 
YILKSMSDDA UTYWNKAST SELKDFFTIS 4 80 
LgSLDHSLTF NHSYGHSDAD VUiQSLLEAN S6Q 
ETALKNVFTA UU5LIYKFPS TFYBGRADMC 64 0 
ISVSQLXADV VGIGETRFQQ SLSIINNCAN 72 O 
SYASTPEIJ*. TWUDSMARIH VKNCTLSEAA BOO 
^CTVLKELL EQCADGLWTA ERYELIADTY 880 
NVKMIQDSGK VNPXDLDSKY AYIQVTHVIP 96 0 
S"pW RIPVMYQHHT DUNPIEVAID 104 0 
£S£p£S KLLKEVFP-QF veacgqa^v 1120 
IFHAISGTPT STMVHGMTSS SSW 1194 
| 60 I 70 I 80 



I 10 I 20 | 30 I 

AATTGTAATA CGACTCACTA TAGGGCGAAT TCGCTACCOG ta-rJv^J^ 1 60 I 70 I BO 

. GAATTCGGCA CGACTTTTAC ACCATCACCA AAACCCAGAA ^E^IS GAGCT< ^CG GTATCGATAA GCTTGATATC «o 

. ATGAAAAGCA CCACCTGTTG CTCACATTCT TCcSS^ SSSIS™ H*™*"" AGAGTIGCCC ACTCAGCTGC 

I GTCGTIGAAA CCCAACTTGG CTACTCCTQG CT^^ TCAAGTAAAG GAAGCACGAA GAAGA^^ 

1 GCTCTCGGCG AACCTTCCTT CGGCCTATCT AAGGCTG C TC ACAAGCGAGC AGoSSS£ 2< ° 

1 G ^f^? AGGCAAGC " CTCCTCAAAA Tr£5S£ T^£^ "TOGCAGGCA TTATOGTCCG GAAaSSS 

1 ATTTTTTCCA GTACTCTCAG AAAACCGAAT CTGGAGCCCA IS^ST"* ACAGGGA TAC TCAGGATCAG CATTTACA^I ^ 

1 CATGCGATGG AAGGCCACGT GATGATCGCC TrSS ^23^ TAAAGTACCT TaISSSS iSS 

1 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT ^S^S^ CCAGCTGTTC CGAGTCCTCA CCAOAG^ ! *° 

1 ACTTGAGGTC ATATCTTAAG TACGCGTATA A^rJT^ CATGTGGTTG CCCAGTCCCA TGAGGAAGGA TTC*^^ C 640 

U ACCAAATCCA TGACCACGAT TCTCAAG^ ATATCI " ro CC TCTGAATACA AGACACTcS ^2^^ ?2 ° 

81 CTTTGATGTA CTGA7TAAAT CTA^SS ^TCACCAG CAACAAACTA CTGAoSaS »<»0 

€1 CTGCATCCTA TCATCATGCA GCGGAAAOOG SJSSSS S^S^ GCTGCGAAAC" CAGAGATT't3 88 ° 

♦ 1 GAGGCATCTA AGAACGCGAA TCATAGCCTT OC^^I SST^^^ CACATCACTC AGAAGTTTGG AOA^^ 960 

21 CAAGCAGATC • AACAACTACA TTAGCTC7TT TCCTCCTTian J^S* 6 *" 5 TITCACCrrC ATQGACAGGG QcnTrirw 10 "° 

01 GTGTAGTGTG CAACCATGAA CATTATMT^ CCCT ^™^ ATACAAG^ GaIS^I , 1X2 ° 

:61 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCR r^I^ TTOGW ^ GCAGGATTCA 1200 

*" ^^TCC GGGAGGTCCG C™*™** CTGTTACTGA GGGAGGTGGG ^ 

.41 ATGACAGATA TGCITCAAGG AGCCATCAGG CAAGGATAGr r^^ 00 TCAAGAAC " GOGATAAAG CWTTnrS 60 

S21 CTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCT^^ ^CCCTCTAC CTGCCrCTCT TTGGTCTGCT GA^HIT? l4 «° 

CM ACCAGCTGTG AATCCGCTCG TCACGCCGCA SSgGAAgI I^^** 3 C9a ^«= a jQ AAGGATGAAT ^0^5 1520 

681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAA^S^f ^^^^ *C*GCCTGCA CAAGGACCTC CTOG^S^T 16 °° 

761 ATAAGCACAG ATTOOQGTAA CAGCCTTCCA GAA^S^ ^JCAACACT GTCAGAAATC CTGA3TCGAG AoS^^ 1680 

841 CACATTGGGA AATTCCGTCG TTCGCTGTGA T^^I* GTC *GAAGAG CAATTCCCTC GATAAGCACC A^T^T^ X7 «° 

«1 TCTTAAAGAG CATGTCTGAT GATCCT^ ^^ZT^f CAGTCTGAOA TTAAGAGCCT ACTGATGTGT T^^T^ "«0 

'001 ATATCTGAAG TCTOCCTGCA CCAG^S ^A^GGA ^AACATCIG AACTTATOGA S^S^ " 2 ° 

;061 AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTO^ AGCGATACAT AGCCAGGAAC CAGGAGGQGT TGQGAf!r^»> 2 °°° 

Jl«l GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAG^S ^^^CA TOC^GATTC CAO^SS^ 2080 

2341 GCCAACATTG CTACTGAGGT TTGCCTGACA GctSI^ ^ CACTCGSA C OCAGATGTTC TGCACCAGTC 2160 

2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT «?^OVTIG GOTTTTAAGA aSgS^ 2240 

2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA OOIW^ii ^^ATTTTCTACC T GTGT TT TCT TCAAAAA^AT ^S^T^ 2320 

24.1 ATGTCTGCGG CTCTGTGTTA CGAGATTC^ ^^tl TTATAAGTTT CCCTCAACAT TCTATGA^ S^SS^ 2 *°° 

25«1 GCTCTACTTC CTGATGAGGA ACAACTTTCA ^CTCCAAGCT GAGCTCCATC AGGACOGAOG 2480 

2«1 CTGTCAGCXA GCTGATAGCA GACGTT^TTG TTGTCCGGAC ACATTTQ^AA G^SSSS 2560 

2721 GCCAACAGTC ACCGGCTTAT TAAGCACACC ^GCTTCTCOT ^E^^ CAGCAGTCCC TGTX^ATCA? Sa^SS^ 2640 

2801 AATGGCCACC GCCCAGATCA AGGAGCATCA GAACGAcSrt P^TOTGAA GGACTTAACC AAAAGGA^ S^SSI 2720 

28B1 ATGCCAGCAC GCCCGAGCTC AGQAAGAGCT G^^S ^CCTCCA G^ACAGCCTC G^S^ 380 ° 

29€l GCAGCAATGT GCTATGTCCA CGTAACAGTC CTAG^G^ f*?** 10 ** 3 ATCCA-TOTCA AAAMGTOGA S^I^I 2880 

3041-CGCCrrCAGG GTCATTACCC CAAACATTOA ^ATATCTCAC ACGGAAAGGC GTGTl^AGAC AAgSS^ 29S0 

A 5SJ G 5 ATCT GCTCATOGAG CTCCTTGAOC AG^^f ^^^0 AAGACGTGGG GATGCAGGAT G^SS^ 3040 

3201 ATCTACAAAC TTATCATCCC CATPrATOAG AaSSS^ ^f!^ 00 **AGCCGAGC GCTOCGAoS S^^^T^ 3120 

3281 GGAACCCAAA CTCACACCGC TGTOGGAAAT ATIT1 ™ AGATGAAGAT GGAAAGGAOT S^?^ 3200 

T 5 AA ** TC * r ACAGGAITCT ^GCAAGGTCA Hc^stt ^™°A TAAATTTOCT 3280 

3441 AT CC C CITCT TTGACGAAAA AGAGTIgSa gSI^^ J^^H^ ^^^M ACATCX^OOT GA^^S 33 «° 

3521 TGAGATGCCA TTTACGCWSA CCGGGAAGAG GCaS^ ^gy mTCR «W»TCCCAC AACATCCOCC G^S^S 3440 

3«01 TACACTGCTT CCCTTATGTG AAGAAGCGCA GTOGAAGAGC AGTCCAAACG GCGCAcStC SS^^T 3520 

3«1 ATTGACGAGA TGAGIAAGAA GGTQGCGSAG S^Scl^ ^^ACC TGAAcSS SaG^^ 3C °° 

37S1 CAAACTCCAG GGCAGCGTGA <nGTTCAGCT CAaS^ l^"^ GOCC ^^ GACATOA^ AA^S^S 3 "° 

I*'"; CA * A ff GA ' IA TCCTGACAAT AAAG1GAAGC f^AGCAT ATGCGOGAOC TTTCTTAGAT 0^5^^ V'° 

3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTO^ ^ZT™* 0 * ^^rTGTCG AAGCTTOOOG TC^^^ 3840 

4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT Sg^^ ol^^ ^TOAAAOCC AACTaSgQ^ AAA^GO^ 392 ° 

4081 TCAACGCCAT CAGTGQGACT CCAACAAGCA S^S^I^ GAGGAGAAGA OGAGCGTCTT ACCGAATTCC rrJ^ 0 ^ *° 00 

4161 ATGGCCCGTG TGTGGGOfiCT TOCTTTCTCA TTTG^AAii^T S^^^ AGCT C P TC 7T CGG^S 4 ° 8 -° 

4321 AGAAGGTGCA CATATTXTTT TAAATCTCAC ^GGCAATATT £*f ** CGT " 1 "TCTTAACA GACTTTCTAT AOGACT^ 42 "° 

44M TCTTGAGCTO GACnAGATT TTATTCTTCC T^SgAG^I ^ff 1 ™ A^GTGTCTT AACAAAOGTG TO^S 4 f " 

4481 ATCTACATGG CAGGGAGOGC TGCACTGACA TOM^ <STBTTMSWT AGATGGCCTA CAGAAAAAAA AGCTtS^ 4400 
4561 AATCAGGGTA CAGAACT7AC TAGtStgTC WGGAG^ATG ^^^^ CGTGC^ ATCTGATOTT 

4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGaSS^ "^TATGACT AGGATTTGTG CTATTATCTC ATTCA^^ 4S6 ° 

•4721 TCATOGACTC CGGGAOGGTC ATATATO SS^Sa ^I^^ ^CnGGCTTCA 7U3TCAGAACT SSSrt^ 
4801 TGTGATAAAT TTCnGCTGGT CCAGTATATG Sa^SS ACTCACATGG GC1TATGCAT SSSSS till 

4881 TOTATACAAG TCTTTACT ^iACACTT TAATGGTTTA TTCTTCTCAT AAAAATGTCC Xnrl-^C^ 4800 

I 10 I 20 | 30 I 40 , ^raroOAGA.4 B80 

I 50 , 60 , , 0 ^ 



Fidf. IDC 



I 10 I 20 

KEGHVMIAFL 

SKNANKSIAV "^CFTFKD 
nLDY SLTDEF CRNHFLVGU. 
RIN VRDVSPF F^ 1 ^™ 

lATEVCLTAL ^TI^UFTIAF 

i SDRXiI KHTSF SSnVKDLTKR 
MCYVHVTALV WnTVTRKGVF 

1 FFDEKEliQER KTEFERSHNI 

1 NERUKEDQt. EYQEEHXANY 
| 10 I 20 



I 30 
LTRATOEEVA 
XSWFFTDVL.I 
RGFVFXQITCN 
UIEVGTAIQE 
ES1AUWHP 
HQQSSTLGNS 
GLGPIVHDRX 
KNQL1ADHGH 
EASQLLYFLM 
IRTV1MATAQ 
RQGCTAFKVI 
EYXYXEPKLT 
RRFMFEMPFT 
IKLQU3JQGS 
REHAJOELSEI 
I 30 



I «0 

VNVTRVIIHV 
KSMRQHL.IEN 
YISCFAPGDP 
FREVRLIAIS 
LVTPQKGSTL 
WRCDKLDQS 
SQTUVSKNR 
NPIMKKVFDV 
RNNFDYTGRX 



TPNIDEEASM 
PLSEISOKLL 
QTGKRQGGVE 
VSVQVNAGPL 
MKEQICPLKK 
| 40 



I 50 I 60 I 10 I 80 

VAOCHEEGLE EHUUJYVXW YKAKPYVASE YKTVKEELTK BO 
SFVnJjRNQR FPASYKHAAE TVVMMU4FHI TOKFCTNPEA 160 
KTUFEYCTEF LRWCNHEHV IPLKLPMFFG KCRIQRYOTL 240 
SuaSllMS FDDRYASRSH OARXATLYLP UdXIEKVQ 320 
SsSroUUS AISGXASPYT TSTPNIHSVR KADSRGSLIS 4 00 
EIXS1XMCFL YILXSKSDDA UTTfWOTWST EE1XDEFTIS 4 80 
TWWAR1QQ IXJSLBNSMT MHSYGHSDAD VU10SLIJ*K 5*0 
YLCFUJKHOS ETAUOWPTA l*SIJpr*FFS TFYBGRAEMC 640 
SFVRTHWIVI ISVSQUIADV VGIGETPJQQ SLSIHTOCAM 730 
LVDUQYSLAK SYASTFELRX TWUDSMARIH VKNGDLSEAA 8 00 
KEDVGMQDVH FNEDVU4EIX EQCADGl>OCA KPYSULADIY 880 
KXYSDKFGSE NVEMIQDSGR VNPKDLDSKY AYXffVTHVlP 960 
EOCKRKTILT AIHCFPYVKK RIPVKYOKHT DUTCEVAII, 104 0 
AYARAFUJOT NTTRYPDNKV 

KTEVUFNSXii IFHAISGTPT STMVHGKTSS SSW 1194 
| 50 | 60 | 70 | 80 



2161 
2241 
2321 
2401 
2481 
25€1 



AATTGTAATA 
GAATTCGGCA 
ATGAAAAGCA 
GTCCTTGAAA 
GGTCTCGGOG 
GGGTAGATGG 
ATTTTTTCCA 
CATGCGATGG 
ACAGGAAGAA 
ACTTGAGGTC 
ACCAAATCCA 
CTTTGATGTA 
CTGCATCCTA 
GAGGCATCTA 
CAAGCAGATC 
CTGTAGTGTG 
GACCTCCAGC 
GACAGCCCTC 
ATGACAGATA 
GTCCAGOGGA 
ACCAGCTGTC 
TCTCCGGCAT 
ATAAGCACAG 
CACATTGGGA 
TCTTAAAGAG 
ATATCTGAAG 
AGTTCATGAT 
GCAGCCTGGA 
GCCAACATTG 
GGCCGACCAT 
CQGCTTTAAA 



3121 
3201 
3281 
3361 
3441 
3521 
3 €01 
3681 
3-J61 
3«4l 
3S21 
4001 
4081 
4161 
4241 
4321 
4401 
44B1 
4561 
4641 
•4721 



GCTCTACTTC 
CTGTCAGCCA 
GCCAACAGTG 
AATGGCCACC 
ATGCCAGCAC 
GCAGCAATGT 
•CGCCTTCAGG 
ACGAGGATGT 
ATCTACAAAC 
GGAACCCAAA 
TCAAAATGAT 
ATCCCCTTCT 



TACACTGCTT 
ATTGACGAGA 
CAAACTCCAG 
CAAAGCGATA 
GCGGTAAACG 

GGAGcnrrcT 

TCAACGCCAT 
ATGGCCCGTG 
CAGGGAGGAC 
AGAAGGTGCA 
TCTTGAGCTC 
ATCTACATGG 
AATCAGGGTA 
ATAGAGCAAG 
TCATCGACTC 
TGTGATAAAT 
TOTATACAAG 
I 10 



I 20 I 30 I 40 

CGACTCACTA TAGGGCGAAT TGGCTACCGG 
CGAGTTTTAC ACCATCACCA AAACCCAGAA 
CCACCTGTTG CTCACATTCT TCCATCTCAG 
CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
AACCTTCCTT CGGGCTATCT TGGCTACCAA 
AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
AAGGCCACGT GATGATO3CC TTCTTGCCCA 
GTCGCGGTTA ACGTCACTCG GCTCATTATT 
ATATCTTAAG TACGOGTATA AGCCTGAGCC 
TGACCACGAT TCTCAAGCCT TCTGCCGATT 
CTGATCAAAT CTA7GGCTCA GCATTTGATA 
TCATCATGCA GCGGAAACOG TTGTAAATAT 
AGAACGCGAA TCATAGCCTT GCTOTCTTCA 
■ AACAACTACA TTAGCTGTTT TGCTCCTGGA 
CAACCATGAA CATTATATTC CGTTGAACTT 
TTGACTACTC ATTAACAGAT GAGTTCTGCA 
CAGGAGTTCC GGGAGCTCCG TCJGATCGCC 
TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
TCAATGTGAG GGATGTQTCA CCCTTCCCTO 
AATCCGC7GG TGACGCCGCA GAAGGGAAGC 
TGv.iiT.iCCA TATACAACCT CAACTCCAAA 
A TTCGGgTA A CAGCCTTCCA GAAAGGAATA 
^JS^H 30 TTOGCrCT1 ^ TAAACTTCAC 
CATGTCTGAT GATGCTTTGT TTACATATTG 

tctgcctgca ccagttccag tacatogqga 

CGAAACTCTC AGACATTGCC TGTTTCCCGT 
TAACTCTCTC ACTTTTAACC ACAGCTATGG 
CTACTGAGGT TTGCCTOACA GCTCTOGACA 
GC ACATAAT C CTCTCATGAA AAAACTTTTT 
**^GTOTC ACTGCCTTAA GGTCCTTAAT 
CTCTGTSTTA CGAGATTCTC AAGTGCTGTA 
CTGATUAGOA ACAACTTTGA TTA^S 
GCTCATAGCA 6ACGTTGTTG GCATTGGGGA 
ACOGOCrrAT TAAGCACACC AGCTTCTCCT 
GCOCAGATGA AGGAGCATGA GAACGACCCA 
GCCCGAGCTC AGGAAGAOBT GGCTCGACAG 
GCTATGTCCA CCTAACAGCC CTAGTGGCAG 
GTCATTACCC CAAACATCGA CGAGGAGGCC 
GCTGATGGAG CTCCTTSAGC AGTGCGCAGA 
££1^1^ ^"TATOAG AAGCGGAGOG 
CTCACACCOC TGTCGGAAAT TTCTCAGAGA 
ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
TTIACGCAGA COGGGAAGAG GCAGGGCGOG 
CCCTTATGTC AAGAAGCGCA TCCCTOTCAT 
TGMTOAGAA GGTGGCGGAG CTCCGGCAGC 
OGCAGCGTGA GTGTTCAOGT CAATGCTGGC 
TTXTCACAAT AAAGTOAAGC TT5CTTAAGGA 
AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
GAAATCATGC ATGAGCAGAT CTOCCCCCIG 
CAGTWJGACT CCAACAAGCA CAATGGTTCA 
TGTGGGGACT TGCTTTCTCA TTTGCAAACT 
CAAGG3GAAG GGGAGAGAAA GGAAATAAAG 
«2liA m TAAA TCTCAC TGGCAATATT 
GACTTAGATT TTATTCTTCC TTGCAGAGTA 
CAQGGAG6GC TSCACTGACA TTGATGCCTG 
CAGAACTTAC TAOTT TTGTC TAGGAGTATC 
AATAGTGAGC TAACTGAGCT AGACACTCAA 
COOW:a ° 1X: ^TATATCTAT TACATTTCTA 
TrGTCC TO CT CCAGTATATG CAATACACTT 
TV j 1 TACT 

I 20 I 30 | 40 



I 50 
GCCCCCCCTC 
TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCTTGGGA 
TCTGCTTTCT 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTG 
ATATGTTCCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
TGAACGCGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGCGATACAT 
AACAGAACAG 
CCACTCGGAC 
CGCT I rCTCT 
GATGTCTACC 
TTATAAGTTT 
ACTCCAAGCT 
AAGAAGTCCT 
AACCAGATTC 
CTGATGTGAA 
GAGATGCTOa 
CATOGCCAGC 
AATATCTCAC 
TCCATGATQG 
TGGACTCTGG 
ATTTCTTTGA 
CTCCTTAAAC 
TCTQGATTCT 
CAGAGTTTGA 
GTGGAAGAGC 
GTACCAGCAC 
TGTGCTCCTC 
C CACT AOCAT 
AGTTrrCAGG 
ATCAGGAAGA 
GAGGAGAAGA 
CGGGATGACC 
CAGGATGCTT 
AACAACGTTA 
CAAAGTTTTC 
6TGTTAGAAT 
gGGGACCTTT 
TTGTATGACT 
TTAATCOGCT 
CAT rTTTAAT 
TAATGGTTTA 

I 50 



| 60 
GAGGTCGACG 
AGATTAAAAT 
TCAACTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACXaVACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACCTTC 
CCCTCTTTGA 
TTTGGAAAAG 
CTTGGTGGGA 
TCAAGAACCT 
CTGCCTCTGT 
CATGACC G T G 
ACAGCCTGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAACATCTG 
AGCCAGGAAC 
GAATGATGCA 
GCAGATGTTC 
ATTTACATTG 
TGTGTTTTCT 
CCCTCAACAT 
GAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 
ACGGAAAGGC 
AAGACGTGGG 
AAAGCCGAGC 
AGATGAAGAT 
TGTACTOGGA 
AAGTATGCAT 
GAGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGTG 
ATGCGCGAGC 
CAATTTGTGG 
AATGAAAGCC 
CGAGCGTCTT 
AGCTOGTCTT 
TCCAAAGCCA 
TTTCTTAACA 
ATTGT 
AGATGGCCTA 
TGCCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACATGG 
TTCTTGTCAT 



| 66 



I 70 I 80 

GTATCGATAA GCTTGATATC 8 0 
AGAGTTGCCC ACTCAGCTOC 160 
GAAGCACGAA GAAGAGGGAT 240 
j^GOGAGC AGCACATCCC 320 
TTATGGTCCG GAAATTAAAT 4 00 
TCAGGATCAG CATTTACATA 4 80 
TAAAGTACCT TAAGAGTX^S 560 
CSA^TCCTCA CCAGAGCCAC L o 
TGM5GJ ^«» TTOGAGAGCC 720 
^CAOTCCK TGAAGAACTG 800 

^^*^*AAAC' CAGAGATTTC III 
AGAAGTTTOG AGATAATcS 104 0 
ATOGACAGGG GCTrTOTCTT lljn 
ATACAAGTTT GAATITCTCC 12O0 
OCAGGATTCA AAGATACCAA llll 
CTGTTACTGA OGGAGGTGGG llH 
GCTGATAAAG CATTC1TTTG 14 H 
"ITOGTCTOCT GATTGAAAAC 1S20 

CA * QG *CCTC CTOOGCGCCA 1680 
CTS « I TCGAG AGGATCTCTC 17«o 
GATAAGCACC AACAAAoSS lelo 
ACTGATGTGT TTCCTCTACA llH 
AACTTATGGiA TTTTTTT'ACA JOOO 

TGCCAGA TTG CAOCAGCItaa 21«n 

COOTTrAA£ » ACCAOCTXXn- 232o 
TCAAAAACAT CAGTCTTGAAA 240O 
TCTATGAAOG GAGAGCGGAC 24 n 
AOGAC «»«S CCTCCCAOCT 2S60 

sssss-sssssiS 

^CAOGAT GTCCA7TTCA 312o 
OCT « X »OCr CATCOCOGAC 32QO 
° GAAAOG *«' ATATTTACAA 32BO 
TAAATTTOGT IXnGAAJ^S 33 « 
ACATCCAGGT GACTOWCGTC lltl 
^TCCGCC QCTT^ATGTT 3520 

^^^X^.OGAGGTOgS 36« 
«CATCATCA AA^CaS 3760 
"TCTO^OJtt GATACAAACA 38 4 0 
tt^™ 0003 TCAAGCCTXA 3920 
^^OLOGG AAATGGOGAA 4O00 
ACOGAATTCC CTTCACATCT 4O80 
ATTA CATCTC 4160 
A T2 fc £r?2? S ^«»CCGAOCA 424 0 
GACTTTCTAT AQGAGTTGTA 432D 
AACAAAGGTG TOGTAGACAC <ll 0 

CCTGCCGGAA ATCTGATCGT 4 560 
CTMTATCTC ATTCAACAAC 4640 
^f^^CT TTGTCATTAA 4720 
GCTTATOCW TAAGTTTAAT 4 800 
*AAAATGTGC AATATGGAOA. 4880 
4898 



I 



I 



i „ n I 30 | 40 I 50 cLvvtva -yKAEFYVft^E 'FKTVHEEIjTK. BO 

I *° _ T ' QLF iv fcTRATQEFVA VKVTFVIIHV VAOCKEEG1JE S"^£££ TVVNKl^PHI TQKFGDKPEA 160 
KEGHVMIAFL ^^f^. "sv^FFDVLI KSHAQHLIEN SKV^J^ 1PU iLPMPFG KGRIQRYQOli. 240 

SKHAWiSlAV ^^ET?T^ SUctAWE FKEVKUAIS VUWUJWS TO£J|J£J ^WCTSVR HADSKGSMS 400 

QLXIYSLiTDEF C^K^VGl^ ££IxPAVNP LVTPQKGSTL »««I»UB UTYHWKAST EEUfflriTIS 4 80 

RINVWJVSPF FVNA ^Irr; HOOSSTUGNS WRCDKLDQS EIXS1AMCFV t^svcKSDM) VLHQS1XEAH 560 

TOSGHSLPE* NSE^SI^^ ^P^™ S^PVSRHK TGM«H^ SK™*** TFYEGRAEHC 640 

EVCUHQFQW GKR^^JJ ££u*WCai NPU^rVFUV VL^KHQS ^SxRFQQ 720 

AAUCYFIU^ CNSW^SICT jfKEHENDPEH ^D1^SU« EQCADGL>TKA KRYEI-XJUJIY 8 80 

; s^S sssss ssss ss Ss ssss sees t" 



I 



AT7T7TAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG err. . . . I ™ i 70 | B0 

AATTCCGCA CGAGTTTTAC ACCATCACCA AAACCCaga* ^^31 GAGGTCGACG GTATCGATAA GCTTGATATC 8 0 

TCAAAAGCA CCACCTGTTG CTCACATTCT TTCATCTCAG CT^I!^ *GA™AAAT AGAGTTGCCC ACTCACCTac 160 

.TCGTTGAAA CCCAAGTTCG CTACTCCTGG CTTCCCcSc ^T^^ I^^S^S CWCCADiAA CAAGAGGGAT 240 

ctctosgcg AAccrrecrr cgggctatct tcgctSS S^^f ^^ GTGCTG ^scxacc agcacatccc 320 

JGGTAGATGG AGGCAAGCCA CTGCTGAAAA TOCaSS T^T^ T ™ rOCTCC 0 CAAATTAAAT 400 

vTxrrrrccA chactgtcag aaaaccgaat ct«SSS acagggatm: tcaggatcao oo-ttacata <bo 

rATGCGATGG AAGGCCACGT GATGATOGCC £££££ AACCAACTTG TAAAGTACCT TAAGAOTCTt! 560 
=.CAGGAAGAA GTCGCCCTTA ACGTCACTCG GStSttaTt" CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 6« 

&CTTGAGGTC ATATGTTAAG TACGCGTATA SgSgIgS £££££ TCAGGAAOGA TIX^AQAOCC ^0 

ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTCCTOATT TCTGWACA AGACACTGCA TGAAGAACTG BOO 

CTTTGATGTA CTGATCAAAT CTATGGCTCA OcSSSS SSIS^ ^* ACTA CTCAOCTACT ^80 
CTGCATCCTA TCATCATGCA GCCGAAACCG TTCTAAATAt" AAGTTAACTT GCTGCGAAAC' CAGAGATTTC 

GAGGCATCTA AGAACGCGAA TCATAGCCTT GCT^JrrS T^AG^ "^^nTX AOATAATCCA 

CAAGCAGATC AACAACTACA TTAGCTC1TT TGCTCCTGGA GA^S^™ TZ^EH? ATTC *CAG«= GCrTTCTCTT 

U IUiAUiUi u CAACCATGAA CATTATATTC A^SS^ CCCTCTITOA ATACAACTTT GAATTTCTCC 

GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA rf^^ TTTSGAAAAG °CAGGATTCA AAGATACCAA X2BC 

uacagccctc cacowtttcc gcgaggtccg tctcaSS at££^£ crcrrrACTGA ajGAGcraoS llll 

ATGACAGATA TGCTTCAAGG AGCCATCAOG CAAGGATaS rl^SS^ J^^L GCTGATAAAG CATTCTTTTG l^n 

GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTtcSS ^r^S^ CTG C C I CTGT ITWfCltJCr GATTGAAAAC llll 



960 
104 0 
1120 
1200 
1280 



v^^mv.ow juwrerew GGATCTGTCR CCCTTCCCTG ^.-^ITVZ nwinutr GATTGAAAAC 1S20 

ACCAGCTGTC AATOCGCTGG TGACGCCGCA GAAGGGAAGC 1^^? CATGACCGTG AAGGATGAAT CCCTOGCTCT ISOn 

TCTCCGGCAT TGCTTCTCCA TATACAACCT rt^^^T AOCCTGGACA ACAGCCTGCA CAAGGACCTG CTOGGCG«£ ,f 2 

ATAAGCACAG A^cloGTAA* CAGCC^TCcI GAAAGGAATA GTGa££S CTG ^ TC CTGATTCGAG AGGAT^S SS 
CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC CWCPPCCCTC GATAACCACC AACAAAOTAO 1. 

TCTTAAAGAG CATGTCTGAT GATGCTTTOT 7TACATATTG SSSS^ 1™°"*** ACTGATGTOT TTCCTCTaS 2 
ATATCTGAAG TCTCCCrOCA CCAGTTCCAO TACATTO^ ^CAACATCTG AACTTATGGA TXTXTTTACA 

ACTTCATGAT CGA AACTCT C AGACATTGCC TGTItSot ^^I^^ AGCCAOGAAC CAGGAGGOST TGGGAC^ 

• CCAGCCTGGA TAACTCTCTC ACTTTTAACC aSg^^^ ^5^5^ TCCCAGATTQ «OoScTTO 

^ GCCAACATTG CTACTGAOGT TTGCCTGACA GCTCtSaS ™* CmX * C OCAGATGTTC TGCACCAGTC ATTACTTGAA 

I GGC CGACC AT GGACATAATC CTCTCATCAA AAAACTTm SS^TS ATTTACATTG GOGTTTAACA ACCAOCTCCr 

1 CCaCITTAXA ^^TT C ACTGCCTTAA GGtStIS TT^SS 1 i ^ TCAAAAACAT- CI^TGAAA 

1 ATCrCTGCOO CTCTGTGTTA OGAGATTCTC AACTGCTO^ ^^ATAAGTTT CCCTCAACAT TCTATGAAOG GAGAGCXSQAC 

1 GCTCTACTTC CTGATGAGGA ACAACTTTSA mS AAgII^S SSS^" AroAOTOW « ^^0^ «C0 
1 CTGTCAGCCA GCTGATAGCA GACCTTGTTG GCATTOGGGX TrCTCTOQW: ACATTTOCAA OTCATCA»a- 2s!o 

1 OCCAACAGTG ACCGGCTTAT TAAGCACACC CT^S^ CAGCAGTCCC TGTCCATCAT CAACAACTGT IVil 

1 AATGGCCACC GCCCAOATGA AGGAGCATW GAACGACCCA Sr^^^ ^"^^ AAAAGGATAC GCACGOTOCT 2BOO 

1 ATGCCAGCAC GCCCGAGCTC AGGAAGAOGT GGCtIgaS CAT^SS T^™^ ^CAOCCTG GCCAAATCCT 
; 1 CSCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTC^ ATCCATGTC * AAAATGOOGA TCTCTCAGaS llln 

a "Cgccttcagg gtcattaccc caaacatcga cgSaISc t^S^^ acggaaaggc gtgtttagac aagga^ca? llll 

a ACGAGGATGT GCTGATGGAG CTCCTTGAOC ACTGOG^ Z^T^I^ ^W^aJTGGG GATGCAGGAT GTCCA3TTCA llto 

>1 ATCTACAAAC TTATCATCCC CATTTATGAG AAGC^S It^SSS AAACCCGXGC GCTAOGAOCr CATCG^GAC 3200 

11 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGi SSSH^ AGATGAAGAT GGAAAOGAGT ATATTTaSX 32B0 

SI TCAAAATGAT ACAGGATTCT GGCAAGGTCA aISt^ SSS^ SSS^'SS22!S- !~2^" 5SS 



1920 
2000 
2O0O 
2XSO 
2240 
2320 
2400 

24 BO 

25 CO" 



" TCMAXTCM - ACAOGATTCr GGCAAGGTCA ACCCTAAGGA tTS™ T^J^ T ^" TOC5r - ««»AAATa 

«1 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAA^AAAA rS!3! ^ X ^ tQCKT ACATCCAUUT GACTCACGTC 

21 TGAGATGCCA TTTACG CAGA CCGGGAAGAG GCAGGgSS GTGgSSS ^f^^ AACATCCQCC GCTTCATGTT 
" CCCITATGTG AAGAAGCGCA T^GT^ 



3440 
3520 
3600 
36B0 
3760 
3840 
3920 
4000 
40B0 



61 CAAACTCCAB GGCAGCGTGA GTGTTCAGST CAATGCTGGC GGCCGAGGTG GACATGA TCA AACTOCAOCr 

• 41 CAAAGCGATA TCCTGACAAT AAAGTGAAOC TGCTTAAGsi f^"^ 3 ? ^ ATG ^GCGAGC TXTCTTAGAT GATACAAACA 

• 21 GCGGTAAAOS AACGTCTGAT TAAAGAAGAC CAGCTCGACT iSTS^^ CAATTrGTGG AAGCTTGCOG TCAAGCCTXA 
101 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCOCCTC ^1^^ MTaMSCC AACTACAOGG AAATGGCGAA 
J81 TCAAOGCCAT CAGTOGGACT CCAACAAGCA CAATGGTtS CG, ^ m " ACCCAAXTCC CTTCACATCT 

ATGGCCCGTG TGTGGGGACT TGCTXTGTCA TTTGCAAACT "'^'^^ KC ^^ 

J41 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG S^^SI TC °° UiGCC * *T CACTGGG O AGACCCACCA 

121 AGAACGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT ctS^S^ TTTCTTAACA GACTTTCTAT AGGAGTTGTA 

401 TCTTGAGCTG GACTTAGATT TT ATTCTTCC TTGCAGAGTA S^I^ ^1!^?^ AACAAW3CTG TOGTAGACAC 

481 ATCTACATOG CAGGOAGGGC TGCACTGACA TTGAtSSS i^SS"* CWSM * WR *OOTTCTGOG 

5S1 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTAtI TT^^Z CSS ™= a * S ** *TCTCATCOT „ r , 

<41 ATAGAGCAAG AATAGTGAGC TAACTGAOCT AatSSSS ^ rTTOTG CTATXATCrC ATTCAACAAC 4640 

721 TCATCGACTC CGOGACOGTC ATATATGTAT TACATTTCTA CATT^^S A "^TCA AGTCAGAACT TTGTCATTAA 4720 

801 TGTGATAAAT CCAGTATATG CAWACAcS T^T^ ! ff^S^ GC3TATCC *r TAAGTrTAAT 4800 

- rTACT ^xACAcrr taatggttta ttcttct c at aaaaatgtcc aatatgoaca 



4400 
4480 
4560 



881 TGIATACAAG tctttact 

I 10 I 20 | 30 | 



I SO | 60 | 70 | 80 



4880 
489B 



U*£ fixxJ^S. v^xii v*^* 5. 

DFi.TSnoOO* YSXFFTOVLI XSHfcQHLIEH f*™^"g* Iw*£kW« XGRIOKTODI. 240 

SsL»v fixrctitud rgfvfkqxhn yiecfapgdp rri^xrKFEF ^™=™ty J^SSx* upcuotvq 320 

CRNHTLVGlii iWVGIWflE FRTVRUAIS ™O^WS ?^ISsVR *ADSRGSUS 400 

SiSS™=E ===== 



Fldl. IDE (to*} 



I 10 I 20 | 30 I 40 I 

^ATTGTAATA CGACTCACTA TAGGGOGAAT TXJGGTACCGG nrr^L^Jt! I 60 \ 10 | 80 

IAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA GAGCTCGACG GTATCGATAA GCTTCATATC 60 

VTGAAAAGCA CCACCTGTTC CTCACATTCT TtXATGTCAG S££IST2 f^"^" 1 " W»GTTOCCC ACTCAGCTCC 160 

=TCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCcS^ TcSSS^ ^^"^ ^^ACGAA GAAGAGGGAT 240 

GGTCTCGGCG AACCTTCCTT CGGGCTATCT T^GCTACcS S^^f f^"*™ AC ^?°^ SC *=CACATCCC 320 

GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA S^^S TGGGC * GCC * TTATGGTCCC GAAATTAAAT 400 

attitttcca gtactgtcac aaaaccgaat ctcgagtcS aSSS? acagggwcac tcagcatcag catttacata 4 80 

CATGCGATGG AAGGCCACGT GATGATCGCC TTCnScCA* AACGAACTTG TAAAGTACCT TAAGAGTCITS S60 

ACAGGAAGAA GTCGCGGTTA ACGTGACTCG StSSS St^S^ CCAGCTGTTC CGAGTCCTCA CCAOAOCCAC €4 0 

ACTTGAGGTC ATATGTTAAG TACGCCTATA AGCCTCAGCC ilW™^ ^ GTCCCA rc* 00 **^ TTOGAOAOCC 720 

ACCAAATCCA TCACCACGAT TCTCAAGCCT TCTOCcSS T^I^ TCT3AATACA AGACAGTCCA TGAAGAACTO 800 

CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTtStI SSS^ CAACAAACTA CTGAGCTACT CATOnTHT 6B0 

CTGCATCCIA TCATCATGCA GCGGAAACCO TTCTAAaS S^S^ AAGTTAAGTT GCTGCCAAAC- CAGAGATTTC 960 

GAGGCATCIA AGAACGCGAA TCATAGCCTT GCTCTTCTTCA ^S^I^ S*f*^*fg ^^^TTCG . AGATAATtaS 104C 

CAAGCAGATC-AACAACTACA TTAGCTGTTT TX^CTCC^S gSJSS^ T^^ Z t^^S ^^^TOTT H2Q 

GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTr AW^^T* CCCll'lTH* ATACAAGTTT GAATTTCTCC 120Q 
GACCTCCAGC TTGACTACTC ATTAACAGAT CIWTTTCTGcI "^G^^ GCAGGATTCA AAGATACCAA 1280 

GACAGCCCTC CAGGACTTCC GGGAGGTCCG TCTGAtSS "^^^ "CTTACTOA GGGAGGTGG* 1360 

ATGACAGATA TGCXTCAAGG AGCCATCAGG CAAGGaSS c££S^ ^G^CCT GCTGATAAAG CATTCTTTTG 14^0 
CTCCAGCGGA TCAATGTGAtt GGATCTGTCA CCCTTCCCTC CTGCCTCTGT TTGGTCTOCT GATTCAAAAC 152C 

ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC A^^S C * TGACCCTG **GG*TGAAT CCCTGGCTCT 1«S 

TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTcSS CAT^^ ACAGCCTGCA CAAGGACCTC CTGGGOJCC* JSS 
ATAAGCACAG ATTCQGGTAA CAGOCTTCCA GAAAGGAATA 



3.760 



TCTTAAACAG CATCTCTCAT GATGCTTTGT mcS Sfr^S TTAAGAGCCT ACTGATGTCT TTCrrCTACA 

ATATCTGAAG TCTGCCTGCA CCACTTCCAG TACATOGGgI AG^SS ^LACATCTG AACTTATOSA TTTnxnSl 

AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTTCCGT aSSTIS* AGCCAOGAAC CACGACOGCT TGGOACCCAT 
GCACCCTGGA TAACTCTCTC ACTTTTAACC AaScTATG^ g^^^S ^"^"O CAGoIo^^ 

GCCAACATTG CTACTGAGGT T^CC^ACA GCTCxSIS "^^^ I^fE^ ^ACT^ , 2 

I GGCCGACCAT GGACMAATC CTCTCATGAA AAAACmS UTCTFnrr ? T "* C * TTC SSf 1 ^"^ *0»«TOr 2330 

1 C °° CrrrRAA AAATGTCTTC ACTGCCTTAA GCTCCTXAAT ^tSSS r ' tT^j 1°^°^- CAGTCTOAAA 24oS 

1 ATCTCTGCGa CTCTGIGITA OGACATTCTC AAGTCC3CTA IctS^SI ^S^^ TCTATGAAGG GAGAGCOGAC 2480 



1920. 
2000 
20B0 
2XS0 
224 O 



1 CCTCTACTTC CTGATGAGGX ACAACTTTGA TTAotciGGA AAGAAOT^S- Sr^^^^^^" ^^CTCCCAC^T 2SC0' 

1 CTGTCAGCCA GCTGATAGCA GACGTTGTTC GCA^^J A t^S5^ ACATTTQCAA GTCAlS^ 

1 OCCAACAGTC ACCCGCTTAT TAAGCAGACC ACCTX^TcS ^S^If S^T^ TfT^"^ ^ACAACTOT 2720 



3760 



21 GOGGTAAAOS AACGTCTGAT TAAAGAAGAC CAGCTCGAGT ^^"^^ AABC7TWSG TCAACCCTSA 3920 

GGAGCTTTCT GAAATCATQC ATGAGCAGAT CTGCOCCCT^ GAGGAGAAS& CtlA^^r^T^^x A **"***^*'^ 4O00 

IB1 TCAACGCCAT CAGTOGSACT CCAACAAGCA CAA^rS CGGGATCAO^ ACCGAMTC C "TCACATCT 40.0 

.«1 ATG5CCOCTC TG7GGGSACT- TGCTTTGTCA TTTBCAAACT ^^3^55 A ° CJ * "CCGTCUJUIU ATTACATCTC 4160" 

141 CAGGGAGGAC CAAQGGGAAG GGGAGAGAAA 0011*1^ ItS^I Z^^* ATCRCTSCOC *«»CCCAGCA 424 0 

121 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT caT^SSZ^ 1 1 lEZ^E^ GACTTTCTAT AGGAGTTGTA 4320 

S01 TCTTGAGCTG GACTTAKAIT I U l l i ^-«-^,„ 7^"" A1TCTGTCTT AACAAAOGTG TGGTAGACAC 4400 



»01 TCTTGAGCTG GACTTAGATT TT ATTCTTCC TTGCAGAGTA GTCT^A^^ w!^" AA ^! OOTS ""Wl^CACAC 4 
.81 ATCTACATGS CAGGGAGGGC TGCACTGACA TTGATGCCTG G^I^M ff^?^ C********* AOCTTCTOQG 4 
^V*"?^ TAGTTTXGTC tII^S JtS^S ESSS^ ^Sf^ A !^A»OGX 4 



561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATC ^ j^fl CGTCCOQGAA ATCTGATCCT 

S41 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGaSctSS tS^^S ^T 1 ^^ CTATTATCIC ATTCAACAAC 

721 TCATCGACTC CGGGACGGTC ATATATGTAT TAoSrxSJ cSJttSS tS^S -™***™* 

801 TCTGATAAAT TTGT G CTGG T CCAGTATATG CAATACACTT ^IT^ ACTCACATG G GCTTATGCAT TAAGTTTAAT 

rATACAAG TCITTACT CAATACACTT TAATGCTTTA TTCTTGTCAT AAAAATGTGC AATATGOAGA. „. u 

• 10 I 20 I 30 I 40 I en I «-« ■ -««9e 



4480 

1560 
4640 
4720 



801 TCTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT TAAtS"£ GCTTATGCAT TAAGTTTAAT 480 

881 TGTATAGAAG TCTTTACT *-~-w=TT TAATGCTTTA TTCTTGTCAT AAAAATGTGC AATATGOAGA- 4 880 

I 50 | 



I 70 I 



10 ^TT.lm.T^ l/niATOE!VA VNVxiviIHV VAQcLeGIZ SKLRSYVKX* ™S?T!£^ ?JSSSS 160 

t-TTTRCTTrMD RCSrvncQlHM YISCFAPGDP TTVTErSTEF nARlATLVLP UGLLIEWVQ 320 

La^TT^OE rKTVKI^IS VL^^ HABSKGSUS 400 

^SffVKD ESl^P*™ LVTPQJ33STL m^UG £^SI UTWUKAST SEU4DFFTIS 480 

£se*shsi*k hoosstxghs WKCD^QS ^SSSS KHSTOTSDAD 5«0 

SyOARNQE GUGFXVKDRX SOTU^ERKR TGMMH^OO msxjnCTPS TTYEGRADHC C40 

HOFOW ^l^Trp knqlLAOTGH KPLMXKVFDV YLCFXQKHQS ^GE-rRTOQ SLSIINNCAK 720 

5SK££ SouStu* RNHn^Toia; smmflvi "v^^ ?S5££S vt^odi^aa boo 

" lfXI1 * KC ^^mfii™ IKTVLMATAQ MKEKERDPEM LVdXJYSIAK SYASTPnW ^^T^ kryeLIADTT BBQ 

£S£ SSS^ SSS£™ tfhxbeeash S£££5 ,co 

SrrS^K Pl^EISORlA, ,SS«OHHr 104O 
I 10 I 20 ' 



1 AATTGTAATA 
81 GAATTCGGCA 
161 ATGAAAAGCA 
241 GTCGTTGAAA 
321 GGTCTCGGCG 
4 01 GGGTAGATGG 
4 81 ATTTTTTCCA 
561 CATGCGATGG 
641 ACAGGAAGAA 
721 ACTTGAGGTC 
801 ACCAAATCCA 
881 CTTTGATGTA 
961 CTGCATCCTA 
1041 GAGGCATCTA 
1121 CAAGCAGATC 
1201 GTCTAGTGTG 
1281 GACCTCCAGC 
1361 GACAGCCCTC 
1441 ATGACAGATA 
1S21 GTCCAGCGGA 
1601 ACCAGCTGTG 
1681 TCTCCGGCAT 
1761 ATAAGCACAG 
1841 CACATTGGGA 
1921 TCTTAAAGAG 
2001 ATATCTGAAG 
2081 AGTTCATGAT 
2161 GCAGCCTGGA 
2341 GCCAACATTG 
2321 GGCCGACCAT 
2401 CGGCTTTAAA 
2481 ATGTOTGCGG 
2561 GCTCTACTTC 
2641 CTGTCAGCCA 
2721 GCCAACAGTG 
2801 AATGGCCACC 
2881 ATGCCAGCAC 
2S61 GCAGCAATGT 
3041 CGCCTTCAGG 
3121 ACGAGGATCT 
3201 ATCTACAAAC 
3281 GOAACCCAAA 
3361 TCAAAATGAT 
3441 ATCCCCTTCT 
3S2i TGAGATGCCA 
3601 TACACIGCTT 
3681 ATTGACGAGA 
3761 CAAACTCCAG 
3841 CAAAGCGATA 
3921 GCGGTAAACG 
4001 GGAGCTTTCT 
4081 TCAACGCCAT 
4161 ATGGCCCGTG 
4241 CAGGGAGGAC 
4321 AGAAGGTGCA 
4401 TCTTGAGCTG 
4481 ATCTACATGG 
•4 561 AATCAGGGTA 
4 641 ATAGAGCAAG 
•4721 TCATCGACTC 
4 801 TGTGATAAAT 
4 861 TGTATACAAG 



CGACTCACTA 
CGAGTTTTAC 
CCACCTGTTG 
CCCAAGTTGG 
AACCTTCCTT 
AGGCAAGCCA 
GTACTGTCAG 
AAGGCCACGT 
GTCGCGGTTA 
ATATGTTAAG 
TGACCACGAT 
CTGATCAAAT 
TCATCATGCA 
AGAACGCGAA 
AACAACTACA 
CAACCATGAA 
TTGACTACTC 
CAGGAGTTCC 
TGCTTCAAGG 
TCAATGTGAG 
AATCCGCTGG 



I 



30 



ATTCGGGTAA 
AATTCCGTGG 
C A TGTCTGAT 
TCTGCCTGCA 
CGAAAtTTCTC 
TAACTCTCTC 



CTGATGAGGA 
GCTGATAGCA 
ACCGGCTTAT 
GCCCAGATGA 
GCCCGAGCTC 
GCTATGTCCA 
GTCATTACCC 
GCTGATGGAG 
TTATCATCCC 
CTCACACCGC 
ACAGGATTCT 
TTGACGAAAA 
TTTACGCAGA 
CCCTTATGTG 
TGAGTAAGAA 
GGCAGCGTGA 
TCCTGACAAT 
AACGTCTGAT 
GAAATCATGC 
CAGTGGGACT 
TGTGGGGACT 
CAAGGGGAAG 
CATATTTTTT 
GACTTAGATT 
CAGGGAGGGC 
CAGAACTTAC 
AATAGTGAGC 
CGGGACGGTC 
TTGTGCTOGT 



TAGGGCGAAT 
ACCATCACCA 
CTCACATTCT 
CTACTCCTGG 
CGGGCTATCT 
CTGCTGAAAA 
AAAACCGAAT 
GATGATCGCC 
ACGTGACTCG 
TACGCGTATA 
TCTCAAGCCT 
CTATGGCTCA 
GCGGAAACCG 
TCATAGCCTT 
TTAGCTC7TTT 
CATTATATTC 
ATTAACAGAT 
GGGAGGTCCG 
AGCCATCAGG 
GGATGTGTCA 
TGACGCCGCA 
TATACAACCT 
CAGCCTTCCA 
TTCGCTGTGA 
GATGCTTTGT 
CCAGTTCCAG 
A GACAT TGCC 
ACTTTTAACC 
TTGCCTGACA 
CTCTCATGAA 
ACTGCCTTAA 
CGAGATTCTC 
ACAACTTTGA 
GACGTTGTTG 
TAAGCACACC 
AQGAGCATGA 
AGGAAGACGT 
CGTAACAGCC 
CAAACATCGA 
CTCCTTGAGC 
CATTTATGAG 
TGTCGGAAAT 
GGCAAGGTCA 
AGAGTTGCAA 
CCGGGAAGAG 
AAGAAGCGCA 
GGTGGCGGAG 
GTGTTCAGGT 
AAAGTGAAGC 
TAAAGAAGAC 
ATGAGCAGAT 
CCAACAAGCA 
TGCTTTCTCA 
GGGAGAGAAA 
TAAATCTCAC 
TTATTCTTCC 
TGCACTGACA 
TAGTTTTGTC 
TAACTGAGCT 
ATATATGTAT 
CCAGTATATG 



TGGGTACCGG 
AAACCCAGAA 
TCCATGTCAG 
CTTCCCCTCC 
TGGCTACCAA 
TTTCCACTCA 
CTGGAGCCCA 
TTCTTGCGCA 
GGTCATTATT 
AGGCTCAGCC 
TCTGCCGATT 
GCATTTGATA 
TTGTAAATAT 
GCTGTCTTCA 
TGCTCCTGGA 
CGTTGAACTT 
GAGTTCTGCA 
TCTGATCGCC 
CAAGGATAGC 
CCCTTCCCIG 
GAAGGGAAGC 
CAACTCCAAA 
GAAAGGAATA 
TAAACTTGAC 
TTACATATTG 
TACATGGGGA 
TGTTTCCCGT 
ACAGCTATGG 
GCTCTGGACA 
AAAAGTTTTT 
GGTCCTTAAT 
AAGTGCTGTA 
TTACACTGGA 
GCATTGGGGA 
AGCTTCTCCT 
GAACGACCCA 
GGCTCGACAG 
CTAGTGGCAG 
CGAGGAGGCC 
AGTGCGCAGA 
AAGCGGAGGG 
TTCTCAGAGA 
ACCCTAAGGA 
GAAAGGAAAA 
GCAGGGCGGO 
TCCCTGTCAT 
CTCCGGCAGC 
CAATGCTGGC 
TGCTTAAGGA 
CAGCTCGAGT 
CTGCCCCCTG 
CAATGGTTCA 
TTTGCAAACT 
GGAAATAAAG 
TGGCAATATT 
TTGCAGAGTA 
TTGATGCCTG 
TAGGAGTATG 
AGACACTCAA 
TACATTTCTA 
CAATACACTT 



I 



50 

GCCCCCCCTC 
TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCTTGGGA 
TCTGGTTTCT 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTG 
ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
TGAACGCGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGOGATACAT 
AACAGAACAG 
CCACTCGGAC 
CGCTTTCTCT 
GATGTCTACC 
TTATAAGTTT 
ACTCCAAGCT 
-AAGAAGTCCT 
AACCAGATTC 
CTGATGTGAA 
GAGATGCTOG 
CATQGCCAQG 
AATATCTCAC 
TCCATGATOG 
TGGACTCTOG 
ATTTCTTTGA 
CTCCTTAAAC 
TCTGGATTCT 
CAGAGTTTGA 
GTGGAAGAGC 
GTACCAGCAC 
TGTGCTCCTC 
C CACTA GCAT 
AGTTTTCAGG 
ATCAGGAAGA 
GAGGAGAAGA 
CGGGATGACC 
CAGGATGCTT 
AACAACGTTA 
CAAAGTTTTC 
GTGTTAGAAT 
GGGGACCTTT 
TTGTATX3ACT 
TTAATCCGCT 
CATTTTTAAT 
TAATGGTTTA 



I 



60 

GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACCTTC 
CCCTCTTTGA 
TTTGGAAAAG 
CTTGGTGGGA 
TCAAGAACCT 
CTGCCTCTGT 
CATGACCGTG 
ACAGCCTGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAACATCTG 
AGCCAGGAAC 
GAATGATGCA 
GCAGATCTTC 
ATTTACATTG 
T G TCTT T T CT 
CCCTCAACAT 
GAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 
ACGGAAAGGC 
AAGACGTGGG 
AAAGCCGAGC 
AGATGAAGAT 
TGTACTCGGA 
AAGTATGCAT 
GAGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGTG 
ATGCGCGAGC 
CAATTTGTGG 
AATGAAAGCC 
CGAGCGTCTT 
AGCTCGTCTT 
TCCAAAGCCA 
TTTCTTAACA 
ATTGTGTCTT 
AGATGGCCTA 
TGCCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACATGG 
TTCTTGTCAT 



I 70 
GTATCGATAA 
AGAGTTGCCC 
GAAGCACGAA 
ACAAGCGAGC 
TTATQGTCCG 
TCAGGATCAG 
TAAAGTACCT 
CGAGTCCTCA 
TGAGGAAGGA 
AGACAGTGCA 
CTGAGGTACT 
GCTGCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 
GCAGGATTCA 
CTGTTACTGA 
GCTGATAAAG 
TTGGTCTGCT 
AAGGATGAAT 
CAAGGACCTC 
CTGATTCGAG 
GATAAGCACC 
ACTGATCTGT 
AACTTATOGA 
CAGGAGGQGT 
TGCCAGATTG 
TGCACCAGTC 
GCGTTTAAGA 
TCAAAAACAT 
TCTATGAAGG 
AGGACGGAGG 
ACATTTGCAA 
TGTCCATCAT 
AAAAGGATAC 
GTACAGCCTG 
AAAATGGCGA 
GTGTTTAGAC 
GATGCAGGAT 
GCTAOGAGCT 



I 



20 



SO 



60 



TAAATTTGGT 
ACATCCAOGT 
AACATCCOCC 
GCGCACCATC 
TGAACCCCAT 
GACATGATCA 
TTTCTTAGAT 
AAGCTTGOGG 
AACTACAGGG 
ACCGAATTCC 
CGGTCGTGTG 
ATCACTOGGG 
GACTTTCTAT 
AACAAAGGTG 
CAGAAAAAAA 
CGTGCCGGAA 
CTATTATCTC 
AGTCAGAACT 
GCTTATGCAT 
AAAAATGTGC 

70 



I 80 
GCTTCATATC 
ACTCAGCTGC 
GAAGAGGGAT 
AGCACATCCC 
GAAATTAAAT 
CATTTACATA 
TAAGAGTCTG 
CCAGAGCCAC 
TTGGAGAGCC 
TGAAGAACTG 
CATGGTTTTT 
CAGAGATTTC 
AGATAATCXIA 
GCTTTCJTCTT 
GAATTTCTCC 
AAGATACCAA 
GGGAGGTOGG 
CATTCTTTTG 
GATTGAAAAC 
CCCTGGCTCT 
CTGGGCGCCA 
AGGATCTCTC 
AACAAAGTAG 
TTCCTCTACA 
TTT ITTTACA 
TGGGACCCAT 
CAGCAGCTGG 
ATTACTTGAA 
ACCAGCTCCT 
CAGTCTGAAA 
GAGAOCGGAC 
CCTCCCAGCT 
GTCATCATAT 
CAACAACTGT 
GCACGCTGCT 
GCCAAATCCT 

TcrercACAG 

AAGGATGCAC 
GTCCATTTCA 
CATOGCCOAC 
ATATTTACAA 
TCTGAAAATG 
GACTCACGTC 
G CTTCATGTT 
CTGACAGCCA 
CGAGGTGGCC 
AACTGCAGCT 
GATACAAACA 
TCAAGCCTTA 
AAATGGCGAA 
CTTCACATCT 
ATTACATCTC 
AGACOGAGCA 
AQGAGTTGTA 
TSGTAGACAC 
AGGTTCTGQG 
ATCTGATCGT 
ATTCAACAAC 
TTG TCAT TAA 
TAAGTTTAAT 
AATATGGAGA 



I 
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320 
400 
480 
SGO 
640 
720 
8O0 
680 
9€0 
1O40 
1120 
1200 
12S0 
1360 
14 4 0 
1S20 
1600 
1680 
1760 
1640 
1920 
2O00 
2080 
2160 
2240 
2320 
2400 
2460 
2S60 
2640 
2720 
2800 
2880 
2960 
3040 
3X20 
3200 
3280 
3360 
3440 
3520 
3600 
3680 

3 760 
3840 
3920 

4 000 
4080 
4160 
4240 
4320 
4400 
4480 
4560 
4640 
4720 
4800 
4880 
4898 



Flfl. IDd, 



I 1° 

1 KEGHVMIAFL, 
81 SMTTILKPSA 
161 SKNANHSIAV 
241 QLDYSLTDEF 

3 21 RINVRDVSPF 

4 01 TDSGNSLPER 
4 81 EVCLttQFQYM 
561 IATEVCLTAL 
641 AALXTYEILXC 
721 SDRLIKHTSF 
801 MCYVHVTAL.V 
881 KLIIPIYEKR 
9 61 FFDE KEI/QER 

1041 EMSKKVAEUt 
1121 HERL.I1CEDQ1. 

I « 



| 20 
PTILNQLJ'RV 
DFLTSNKLLR 
FIKRCFTFMD 
CRNKFLVGLi 
FVNAGWTVKD 
NSEKSNSLDK 
GKRYIARNQE 
DTI^SLFTUAF 
CNSK1*SSIRT 
SSDVKDLTKR 
AEYLTRKGVF 



rTEFERSKNI 
QLCSSAEVDM 
EYQEEMKANY 



30 

LTRATQEEVA 
YSWFFFDVLI 
RGFVFKQINN 
LREVGTAIOE 
ESiAliPAVNP 
HQQSSTLGNS 
GLGPIVHDRK 
KNQLLADHGH 
EASQLAYFXM 
IRTVLMATAQ 
RQGCTAFRVI 
EYIYKEPKLT 
RRFMFEMPFT 
IKLQIJCIiQGS 



30 



I «0 

VNVTRVIIHV 
KSMAQKLIEN 
YISCFAPGDP 
FREVRLIAIS 
LVTPQKGSTL 
WRCDKLDOS 
SQTLPVSRNR 
NPLMIOCVFDV 
RNNFDYTGKK 
MKEHENDPEM 
TPNIDEEASM 
PLSEISQRLL 
QTGKRQGGVE 
VSVOVHAGPL 
MHEQICPLEE 
| 40 



I 50 
VAQCHEEGUE 
SKVXLLRNQR 
CTUEYWEF 
VIJOILLIKHS 
DNSLHKDLLG 
EIKSLLMCFL 
TGMMHARLQQ 
YLCFLQKHQS 
SFVRTHLQVI 
LVDLQYSLAK 
MEDVGMQDVH 
KLYSDKFGSE 
EOCKRRTILT 
AYARAFLDDT 
FTSVUPNSLH 
I 50 



I 



60 

SKLRSYVKYA 
FPASYHHAAE 
LRWCNHEHY 
FDDRYASRSH 
AISG1ASPYT 
YILKSMSDDA 
LiGSXjDNSLTF 
ETALXNVFTA 
ISVSQLIADV 
SYASTPELRK 
FHEDVt>ELL 
NVKMIQDSGK 
AIHCFPYVXK 
NTKRYPEWKV 
IFNAISGTPT 



I 



60 



TO 

YKAEFYVASE 
TWNMLMPHI 
IPLNLPMPFG 
QARXATLYLP 
TSTPNIMSVR 
UFTYWNKAST 
HHSYGHSDAD 
1JISLIYKFPS 
VGIGETRFQQ 
TWUDSMARIH 
EQCADGUWKA 
VNPKDUDSKY 
RIFVMYQHHT 
IQJJCEVFRQF 
STMVHGMTSS. 



YTTVHEEliTK 80 



KGRIQRYQDLi 
LFGT ,T ,T ENVQ 



SELMDFFT1S 
VLHQSLXEAN 



SL^IINNCAN 



ERYELIADIY 
AYTQVTHVIP 
DLNPIEVAXD 
VEACGQA1AV 
SSW 



I 



70 



I 



80 



160 

240 

320 

400 

4BO 

560 

64 O 

720 

BOO 

8SO 

960 

1040 

1120 

1194 



flf|. IDdi^ttwh) 



I 10 
1 AATTCTAATA 
81 GAATTCGGCA 
161 ATGAAAAGCA 
241 CTCCTTGAAA 
321 GGTCTCGGCG 
101 GGGTAGATGG 
'81 ATTTTTTCCA 
561 CATGCGATGG 
641 ACAGGAAGAA 
721 ACTTGAGGTC 
B01 ACCAAATCCA 
881 CTTTGATGTA 
961 CTGCATCCTA 
1041 GAGGCATCTA 
1121 CAAGCAGATC 
1201 GTGTAGTGTG 
1281 GACCTCCAGC 
1361 GACAGCCCTC 
1441 ATGACAGATA 
1521 GTCCAGCGGA 
1601 ACCAGCTGTG 
1681 TCTCCGGCAT 
1761 ATAAGCACAG 
1B41 CACATTGGGA 
1921 TCTTAAAGAG 
2001 ATATCTGAAG 
2081 AGTTCATGAT 
2161 GCAGCCTGGA 
2241 GCCAACATTG 
2321 GGCCGACCAT 
2401 CGGCTTTAAA 
24 81 ATGTGTGCGG 
2S61 GCTCTACrrC 
2641 CTGTCAGCCA 
2 721 GCCAACAGTG 
2801 AATGGCCACC 
2 881 ATGCCAGCAC 

2 961 GCAGCAATGT 

3 041 "CGCCTTCAjGG 
3121 ACGAGGATGT 
3201 ATCTACAAAC 
3281 GGAACCCAAA 
3361 TCAAAATGAT 
3441 ATCCCCTTCT 
3521 TGAGATGCCA 
3 601 TACACTGCTT 
3 681 ATTGACGAGA 

3 761 CAAACTCCAG 
3841 CAAAGCGATA 
3921 GCGGTAAACG 
4001 GGAGCTTTCT 

4 001 TCAACGCCAT 
4161 ATGGCCCGTC 
4241 CAGGGAGGAC 
4321 AGAAGGTGCA 
4401 TCTTGAGCTG 
4 4B1 ATCTACATGG 
4S61 AATCAGGGTA 
4 641 ATAGAGCAAG 
4 721 TCATCGACTC 
4 801 TGTCATAAAT 
4 881 TGTATACAAG 

I 10 
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CGA CTCACT A TAGGGCGAAT TGGGTACCGG 
CGAGTTTTAC ACCATCACCA AAACCCAGAA 
CCACCTGTTG CTCACATTCT TCCATGTCAG 
CCC AAGT TGG CTACTCCTC5G CTTCCCCTCC 
AACCTTCCTT CGGGCTATCT TGGCTACCAA 
AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
AAGGCCACGT GATGATCGCC TTCTTGCCCA 
GTCGOGGTTA ACGTGACTCG GGTCATTATT 
ATATGTTAAG TACGCGTATA AGGCTCAGCC 
TGACCACGAT TCTCAAGCCT TCTGCCGATT 
CTGATCAAAT CTATGGCTCA GCATTTGATA 
TCATCATGCA GCGGAAACCG TTOTAAATAT 
AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
AACAACTACA TTAGCTGTTT TGCTXXTGGA 
CAACCATGAA CATTATATTC CCTTGAACTT 
TTGACTACTC ATTAACAGAT GAGTTCTGCA 
CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
AA TCCGCT GG TGACGCCGCA GAAGGGAAGC 
TGCTTCTCCA TATACAACCT CAACTCCAAA 
ATTCC3GGTAA CAGCCTTCCA GAAAGGAATA 
AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
CATGTCTCAT GATGCTTTGT TTACATATTG 
TCTGCCTGCA CCAGTTCCAG TACATCGGGA 
CGAAAGTCTC A GACAT TGCC TGTTTCCCGT 
TAACTCTCTC ACTTTTAACC ACAGCTATGG 
CTACTGAGGT TTGCCTGACA GCTCTGGACA 
GG ACATAATC CTCTCATGAA AAAAGTTTTT 
AAAIGiCX-lX . ACTGCCTTAA GGTCCTTAAT 
CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
CTGATGAGGA ACAACTTTGA TTACACTGGA 
GCTGATAGCA GACGTTGTTG GCATTGGGGA 
ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
GCCCAGATGA AQGAOCATGA GAACGACCCA 
GCCCGAGCTC AGGAAGACCT GGCTCGACAG 
GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
GTCATTACCC CAAACATCGA CGAGGAGGCC 
GCTGATGGAG CTCCTTGAGC AGTCCGCAGA 
TTATCATCCC CATTTATGAG AAGCGGAGGG 
CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
TTCACGAAAA AGAGTTGCAA GAAAGGAAAA 
TTTAC GCAGA CCGGGAAGAG GCAGGGCGGG 
CCCTTATGTG AAGAAGOGCA TCCCTGTCAT 
TTSAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
i C CTCACA AT AAAGTGAAGC TGCTTAAGGA 
AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
CAGTGGGACT CCAACAAGCA CAATGGTTCA 
TGTQGGGACT TGCTTTGTCA TTTGCAAACT 
CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
CATATTTTTT TAAATCTCAC TGGCAATATT 
GACTTAGATT TTATTCTTCC TTGCAGAGTA 
CAGGGAGGGC TGCACTGACA TTGATGCCTT3 
CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
AATAGTGAGC TAACTGAGCT AGACACTCAA 
CGGGACGGTC ATATATGTAT TACATTTCTA 
TTGTCCTCGT CCAGTATATG CAATACACTT 
TCTTTACT 

I 20 I 30 I 40 



GCCCCCCCTC 
TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCTTGGGA 
TCTGGTTTCT 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTG 
ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
T^GAACGCGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGCGATACAT 
AACAGAACAG 
CCACTCGGAC 
CGCTTTCTCT 
GATGTCTACC 
TTATAAGTTT 
ACTCCAAGCT 
^AAGAAGTCCT 
AACCAGATTC 
CTGATGTGAA 
GAGATGCTGG 
CATGGCCAGG 
AATATCTCAC 
TCCATGATGG 
TGGACTCTGG 
ATTTCTTTGA 
CTCCTTAAAC 
TCTGGATTCT 
CAGAGTrTGA 
GTGGAAGAGC 
GTACCAGCAC 
TCTGCTCCTC 
CCACTAGCAT 
AGTTTTCAGG 
ATCAGGAAGA 
GAGGAGAAGA 
CGGGATGACG 
CAGGATGCTT 
AACAACGTTA 
CAAAGTTTTC 
GTGTTAGAAT 
GGGGACCTTT 
TTGTATGACT 
TTAATCCGCT 
CATTTTTAAT 
TAATGGTTTA 



| 60 
GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACCTTC 
CCCTCTTTGA 
TTTGGAAAAG 
CTTGGTGGGA 
TCAAGAACCT 
CTGCCTCTGT 
CATGACCGTG 
ACAGCCTGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAACATCTG 
AGCCAGGAAC 
GAATGATGCA 
GCAGATGTTC 
ATTTACATTG 

CCCTCAACAT 
GAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 

AAGACGTGGG 
AAAGCCGAGC 
AGATGAAGAT 
TGTACTCGGA 
AAGTATGCAT 
GAGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGTG 
ATGCGCGAGC 
CAATTTGTGG 
AATSAAAGCC 
CGAGCGTCTT 
AGCTCGTCTT 
TCCAAAGCCA 
TTTCTTAACA 
ATTGTCTCTT 
AGATGGCCTA 
TGCCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACATGG 



I 70 
GTATCGATAA 
AGAGTTGCCC 
GAAGCACGAA 
ACAAGCGAGC 
TTATGGTCCG 
TCAGGATCAG 
TAAAGTACCT 
CGAGTCCTCA 
TGAGGAAGGA 
AGACAGTGCA 
CTGAGGTACT 
GCTGCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 
GCAGGATTCA 
CTGTTAC IViA 
GCTGATAAAG 
TTGGTCTOCT 
AAGGATGAAT 
CAAGGACCTC 
CTGATTCGAG 
GATAAGCACC 
ACTGATGTGT 
AACTTATOGA 
CAGGAGQGGT 
TGCCAGATTG 
T GCACC AGTC 
GCGTTTAAGA 
TCAAAAACAT 
TCTATGAAGG 
AGGACGGAGG 
ACATTTBCAA 
TGTCCATCAT 
AAAAGGATAC 
GTACAGCCTG 
AAAATGGCGA 
GTGTTTAGAC 
GATGCAQGAX 
GCTACGAGCT 
GGAAAQGAGT 
TAAATTTGGT 
ACATCCAGGT 
AACATCCQCC 
GCGCACCATC 
TGAACCCCAT 
GACATGATCA 
TTTCrrAGAT 
AAGCTTGOGG 
AACTACAGGG 
ACCGAATTCC 
CGGTOGTGTC 
ATCACTSGGG 
GACTTTCTAT 
AACAAAGGTG 
CAGAAAAAAA 
CGTGCCGGAA 
CTATTATCTC 
AGTCAGAACT 
GCTTATGCAT 
AAAAATGTGC 



I 



50 



I 



70 



I 80 
GCTTGATATC 80 
ACTCAGCTGC 160 
GAAGAGGGAT 24 0 
AGCACATCCC 320 
GAAATTAAAT 4 00 
CATTTACATA 4 80 
TAAGAGTCTG S60 
CCAGAGCCAC 64 0 
TTGGAGAGCC 720 
TCAAGAACTG 800 
CATGGTTTTT 8B0 
CAGAGATTTC 960 
AGATAATCCA 104 0 
GCTTTGTCTT 1120 
GAATTTCTCC 12 OO 
AAGATACCAA 12 8 O 
GGGAGGTGQG 1360 
CATTCTTTTG 14 4 0 
GATTGAAAAC 152 0 
CCCTGGCTCT 1600 
CTOGGCGCCA 1680 
AGGATCTCTC 1760 
AACAAAGTAG 1840 
3TCCTCTA CA 1920 
TTTTTTTACA 2000 
TGQGACCCAT 2080 
CAGCAGCTOG 21 60 
ATTACTTGAA 224 0 
ACCAGCTCCT 2320 
CAGTCTGAAA 2400 
GAGAGCGGAC 2480 
CCTCCCAGCT 2560 
GTCATCATAT 264 0 
CAACAACTGT 2720 
GCACGGTGCT 2800 
GCCAAAT CCT 2 8 80 
TCTCTCAGAG 2960 
AAGGATGCAC 3040 
GTCCATTTCA 3X20 
CATOGCOGAC 3200 
ATATTTACAA 3280 
TCTGAAAATG 3360 
GACTCAOGTC 3440 
GCTTCATGTT 3520 
CTGACAGCCA 3600 
•CGAGGTGGCC 3680 
AACTGCAGCT 3760 
GATACAAACA 384 0 
TCAAGCCTTA. 3920 
AAATQGCGAA 4000 
CTTCACATCT 4 080 
ATTACATCTC 4160 
AGACCGAGCA 4240 
AGGAGTTGTA 4320 
TGGTAGACAC 4400 
AOGiTCTOGG 4480 
ATCTGATCGT 4S60 
ATTCAACAAC 4640 
TTGTCATTAA 4720 
TAAGTTTAAT 4800 
AATATCGAGA 4 880 
4896 

| 80 



I 10 I 20 | 30 | 40 

1 KEGHVMIAFL PTIUNQLfRV LTRATQEEVA VNVTRVIIHV 
81 SMTTILXPSA DFLTSNKX.LR YSWFFFDVLI KSMAOHLIEN 
161 EKNANHS1AV FIKRCFTFMD RGFVFKQIHN YISCFAPGDP 
241 QLDYSLTDEF CRNHFLVGLL, UtEVGTAljQE FREVRLIAIS 
321 RINVRtJVSPF PVNAGMTV1CD ESLALPAVNP LVTPQKGSTL 
4 01 TDSGNSLPER NSEICSWSIJJX HQQSSTI/3NS WRCDKLDQS 
4 81 EVCU1QFQYM GKRYXARNQE GUSPIVHDRK SQTLPVSRNR 
561 XATEVCLTAL DTISUTXAF KNQULADHGH NPLMKKVFDV 
641 AALCYEILKC CNSKI^SIRT EASQLLYFL*! RNNFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR IRTVLMATAQ MKEHENDPEM 
801 MCYVHVTALV AEYL.TRKGVF RQGCTAFRVI TPNIDEEASM 
8 81 KLIIPIYEKR RDFFEDEDGK EY1* 1CEPKLT PLSEISQRLL 
561 FFDEJCELQER KTEFERSHHI RRFMFEMPFT QTGKRQGGVE 
1041 EMSIOWAELR QLCSSAEVDM IKLQUCU3GS VSVQVKAGPL 
1121 NERLIKEDQL EYQEEMKANY REMAKELSEI MHEQICPUEE 
| 10 | 20 | 30 | 40 



| SO | 60 I 70 | 80 

VAQCHEEGLE SHLRSYVKYA YXAEPYVASE YKTVHEELTK BO 
SXVXLLRNQR FPASYHHAAE TWMMLMPHI TQKFGDNPEA 160 
KTLFEYXFEF LRWCNHEHY IPUfLPKPFG KGRIQRYQDL. 24 0 
VLJCKLLIKHS FDDRYASRStt QARIATLYLP LFGLLIENVQ 320 
DNSLHRDLXG AISGIASPYT TSTPKINSVR KADSRGSUIS 4 00 
EIXSLLMCFL YILXSMSDDA LFTYHNXAST EELMDFFTIS 4 80 
TGMKHARI/QQ LGSLDNSLTF MHSYGHSDAD VLHQSLiEAN 560 
YLCFLQKHQS ETALKNVTTA LRSUIYKFPS TFYEGRADMC 64 0 
SFVRTHLCVI ISVSQL1ADV VGZGETRFQQ SLSirNNCAN 720 
LVDLQYSLAX SYASTPEIJIK TWUJSMARXH VTCNGDLSEAA BOO 
MEDVGMQDVH FNEDVLMELL EQCADGLWKA ERYELXADIY 880 
KLYSDKFGSE NVKMIQDSGK VNPKDLDSKY AYIOVTHVIP 960 
EQCKRRTT1.T AIHCFFYVKK RIFVMYQHHT DLNPIEVAID 104 0 
AYARAFLDET NTKRYPDNKV KUJOEVFRQF VEACGQAXAV 1120 
ICTSVLPNSU1 IFNAISGTPT STMVHGMTSE SSW 1194 
| 50 (60 | 70 | 80 



Exon 1A {-182 to -102) 

GCAGGGGAAAAACCTGGCCCCATGATTCACTTACTTCCCACCGGATCTCTCCCATGACACGTGAGGATTA 
TTACAATTTAA -102 

Exon IB (-219 to -102) 

TTATCCCTTTACTACTTGCGAAGTGAGTTCGGTAGATGGGAGTGGAGAAGAGAACCTTAGAATCATTGTTTAGTCTTCAT 
CTTTCACAGCTCAGGCTGAAGGCCTTTCCTTGCTGAGA -102 

Exon 1C (-143 to -102) 

GCGGCAGAGCGTGTCTGAGGTGGTGCGCGGCTCCGTGCTCCT -102 



Exon2 and the rest of human CLASP2 cDNA 

-101 -79 
GGCAAAGCCAAAGCTAATTGAGC 

-78 "I 
AAGCTAATTGAGCCACTCGACTATGAAAATGTCATCGTCCAGAAGAAGACTCAGATCCTGAACGACTGTTTACGGGAG 

1/1 31/11 

ATG CTG CTC TTC CCT TAC GAT GAC TTT CAG ACG GCC ATC CTG AGA CGA CAG GGT CGA TAC 
Met leu leu phe pro tyr. asp asp phe gin thr ala ile leu arg arg gin gly arg tyr 

61/21 91/31 

ATA TGC TCA ACA GTG CCT GCG AAG GCG GAA GAG GAA GCA CAG AGC TTG TTT GTT ACA GAG 
ile cys ser thr val pro ala lys ala glu glu glu ala gin ser leu phe val thr glu 

121/41 151/51 

TGC ATC AAA ACC TAT AAC TCT GAC TGG CAT CTT GTG AAC TAT AAA TAT GAA GAT TAC TCA 
cys ile lys thr tyr asn ser asp trp his leu val asn tyr lys tyr glu asp tyr ser 

181/61 211/71 

GGA GAG TTT CGA CAG CTT CCG AAC AAA GTG GTC AAG TTG GAT AAA CTT CCA GTT CAT GTC 
gly glu phe arg gin leu pro asn lys val val lys leu asp lys leu pro val his val 

241/81 271/91 

TAT GAA GTT GAC GAG GAG GTC GAC AAA GAT GAG GAT GCT GCC TCC CTT GGT TCC CAG AAG 
tyr glu val asp glu glu val asp lys asp glu asp ala ala ser leu gly ser gin lys 

301/101 331/111 

GGT GGG ATC ACC AAG CAT GGC TGG CTG TAC AAA GGC AAC ATG AAC AGT GCC ATC AGC GTG 
gly gly ile thr lys his gly trp leu tyr lys gly asn met asn ser ala ile ser val 

361/121 391/131- 

ACC ATG AGG TCA TTT AAG AGA CGA TTT TTC CAC CTG ATT CAA CTT GGC GAT GGA TCC TAT 
thr met arg ser phe lys arg arg phe phe his leu ile gin leu gly asp gly ser tyr 

421/141 451/151 

AAT TTG AAT TTT TAT AAA GAT GAA AAG ATC TCC AAA GAA CCA AAA GGA TCA ATA TTT CTG 
asn leu asn phe tyr lys asp glu lys ile ser lys glu pro lys gly ser ile phe leu 

481/161 511/171 

GAT TCC TGT ATG GGT GTC GTT CAG AAC AAC AAA GTC AGG CGT TTT GCT TTT GAG CTC AAG 
asp ser cys met gly val val gin asn asn lys val arg arg phe ala phe glu leu lys 
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541/181 

ATG CAG GAC AAA AGT AGT TAT CTC TTG GCA 

met gin asp lys ser ser tyr leu leu ala 

601/201 

ATC ACA ATT CTA AAT AAG ATC CTC CAG CTC 

ile thr ile leu asn lys ile leu gin leu 

661/221 

AAT GGC GAC TCT CAC GAA GAT GAT GAA CAA 

asn gly asp ser his glu asp asp glu gin 

721/241 

GAT AGC TAC CTG CCG GAA CTT GCC AAG AGT 

asp ser tyr leu pro glu leu ala lys ser 

781/261 

GAA AGC AGA GTC AAA CTT TTT TAT TTG GAC 

glu ser arg val lys leu phe tyr leu asp 

841/281 

GCT GAG CCA GAA GTG AAG TCA TTT GAA GAG 

ala glu pro glu val lys ser phe glu glu 

901/301 

AAT GAT TTA TCT TTC AAT TTG CAA TGC TGT 

asn asp leu ser phe asn leu gin cys cys 

961/321 

AAT GTT GAA CCT TTC TTT GTT ACT CTA TCC 

asn val glu pro phe phe val thr leu ser 

1021/341 

TCT GCC GAT TTC CAC GTA GAC CTG AAC CAT 

ser ala asp phe his val asp leu asn his 

1081/361 

TCC CCG GCG CTG ATG AAT GGC AGT GGG CAG 

ser pro ala leu met asn gly ser gly gin 

1141/381 

GAA GCC GCC ATG CAG TAT CCG AAG CAG GGA 

glu ala ala met gin tyr pro lys gin gly 

1201/401 

ATA TTT CTT GTG GCC AGA ATT" GAA AAA GTC 

ile phe leu val ala arg ile glu lys val 

1261/421 

CCA TAT ATG AAA AGT TCA GAC TCT TCT AAG 

pro tyr met lys ser ser asp ser ser lys 

1321/441 

CAG GCA TGC CAA AGA CTA GGA CAG TAT AGA 

gin ala cys gin arg leu gly gin tyr arg 



571/191 

GCA GAC AGT GAA GTG GAA ATG GAA GAA TGG 
ala asp ser glu val glu met glu glu trp 

631/211 

AAC TTT GAA GCT GCA ATG CAA GAA AAG CGA 
asn phe glu ala ala met gin glu lys arg 

691/231 

AGC AAA TTG GAA GGT TCT GGT TCC GGT TTA 
ser lys leu glu gly ser gly ser gly leu 

751/251 

GCA AGA GAA GCA GAA ATC AAA CTA AAA AGT 
ala arg glu ala glu ile lys leu lys ser 

811/271 

CCA GAT GCC CAG AAG CTT GAC TTC TCA TCA 
pro asp ala gin lys leu asp phe ser ser 

871/291 

AAG TTT GGA AAA AGG ATC CTT GTC AAG TGC 
lys phe gly lys arg ile leu val lys cys 

931/311 

GTT GCC GAA AAT GAA GAA GGA CCC ACT ACA 
val ala glu asn glu glu gly pro thr thr 

991/331 

CTG TTT GAC ATA AAA TAC AAC CGG AAG ATT 
leu phe asp ile lys tyr asn arg lys ile 

1051/351 

TTC TCA GTG AGG CAA ATG CTC GCC ACC . ACG 
phe ser val arg gin met leu ala thr thr 

1111/371 

AGC CCA TCT GTC CTC AAG GGC ATC CTT CAT 
ser pro ser val leu lys gly ile leu his 

.1171/391 

ATA TTT TCA GTC ACT TGT CCT CAT CCA GAT 
ile phe ser val thr cys pro his pro asp 

1231/411 

CTT CAG GGG AGC ATC ACA CAT TGC GCT GAG 
leu gin gly ser ile thr his cys ala glu 

1291/431 

GTG GCC CAG AAG GTG CTG AAG AAT GCC AAG 
val ala gin lys val leu lys asn ala lys 

1351/451 

ATG CCA TTT GCT TGG GCA GCA AGG ACA TTG 
met pro phe ala trp ala ala arg thr leu 
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1381/461 

TTT AAG GAT GCA TCT GGA AAT CTT GAC AAA 
phe lys asp ala ser gly asn leu asp lys 

1441/481 

GAC AGC AAT AAG CTA TCC AAT GAT GAC ATG 
asp ser asn lys leu ser asn asp asp met 

1501/501 

GAG AAG ATG GCT AAG CTC CCA GTG ATT TTA 
glu lys met ala lys leu pro val ile leu 

1561/521 

TCC TCA GAC TTC CCT AAT TAT GTT AAT TCA 
ser ser asp phe pro asn tyr val asn ser 

1621/541 

TGC AGT AAA ACT CCC ATC ACG TTT GAA GTG 
cys ser lys thr pro ile thr phe glu val 

1681/561 

ACT CAG CCT TAC ACC ATC TAC ACC AAT CAC 
thr gin pro tyr thr ile tyr thr asn his 

1741/581 

GAC AGT CAG AAG TCT TTT GCC AAG GCT AGA 
asp ser gin lys ser phe ala lys ala arg 

1801/601 

TCA GAT GAG GAA GAC TCT CAG CCC CTT AAG 
ser asp glu glu asp ser gin pro leu lys 

1861/621 

TTC ACA AGA AGC GCC TTT GCT GCA GTT TTA 
phe thr arg ser ala phe ala ala val leu 

1921/641 

GAG ATT AAA ATA GAG TTG CCC ACT CAG CTG 
glu ile lys ile glu leu pro thr gin leu 

1981/661 

TTC CAT GTC AGC TGT GAC AAC TCA AGT AAA 
phe his val ser cys asp asn ser ser lys 

2041/681 

ACC CAA GTT GGC TAC TCC TGG CTT CCC CTC 
thr gin val gly tyr ser trp leu pro leu 

2101/701 

CAG CAC ATC CCG GTC TCG GCG TAC CTT CCT 
gin his ile pro val ser ala tyr leu pro 

2161/721 

ATG GGC AGG CAT TAT GGT CCG GAA ATT AAA 
met gly arg his tyr gly pro glu ile lys 



1411/471 

AAT GCC AGA TTT TCT GCC ATC TAC AGG CAA 
asn ala arg phe ser ala ile tyr arg gin 

1471/491 

CTC AAG TTA CTT GCA GAC TTT CGG AAA CCT 
leu lys leu leu ala asp phe arg lys pro 

1531/511 

GGC AAT CTA GAC ATT ACA ATT GAT AAT GTT 
gly asn leu asp ile thr ile asp asn val 

1591/531 

TCA TAC ATT CCC ACA AAA CAA TTT GAA ACC 
ser tyr ile pro thr lys gin phe glu thr 

1651/551 

GAG GAA TTT GTG CCC TGC ATA CCA AAA CAC 
glu glu phe val pro cys ile pro lys his 

1711/571 

CTT TAC GTT TAT CCT AAG TAC TTG AAA TAC • 
leu tyr val tyr pro lys tyr leu lys tyr 

1771/591 

AAT ATT GCG ATT TGC ATT GAA TTC AAA GAT 
asn ile ala ile cys ile glu phe lys asp 

1831/611 

TGC ATT TAT GGC AGA CCT GGT GGG CCA GTT 
cys ile tyr gly arg pro gly gly pro val 

1891/631 

CAC CAT CAC CAA AAC CCA GAA TTT TAT GAT 
his his his gin asn pro glu phe tyr asp . 

1951/651 

CAT GAA AAG CAC CAC CTG TTG CTC ACA TTC 
his glu lys his his leu leu leu thr phe 

2011/671 

GGA AGC ACG AAG AAG AGG GAT GTC GTT GAA 
gly ser thr lys lys arg asp val val glu 

2071/691 

CTG AAA GAC GGA AGG GTG GTG ACA AGC GAG 
leu lys asp gly arg val val thr ser glu 

2131/711 

TCG GGC CAT CTT GGC TAC CAA GAG CTT GGG 
ser gly his leu gly tyr gin glu leu gly 

2191/731 

TGG GTA GAT GGA GGC AAG CCA CTG CTG AAA 
trp val asp gly gly lys pro leu leu lys 
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2221/741 

ATT TCC ACT CAT CTG GTT TCT ACA GTG TAT 
ile ser thr his leu val ser thr val tyr 

2281/761 

CAG TAC TGT CAG AAA ACC GAA TCT GGA GCC 
gin tyr cys gin lys thr glu ser gly ala 

2341/781 

CTT AAG AGT CTG CAT GCG ATG GAA GGC CAC 
leu lys ser leu his ala met glu gly his 

2401/801 

AAC CAG CTG TTC CGA GTC CTC ACC AGA GCC 
asn gin leu phe arg val leu thr arg ala 

2461/821 

CGG GTC ATT ATT CAT GTG GTT GCC CAG TGC 
arg val ile ile his val val ala gin cys 

2521/841 

TCA TAT GTT AAG TAC GCG TAT AAG GCT GAG 
ser tyr val lys tyr ala tyr lys ala glu 

2581/861 

CAT GAA GAA CTG ACC AAA TCC ATG ACC ACG 
his glu glu leu thr lys ser met thr thr 

2641/881 

AGC AAC AAA CTA CTG AGG TAC TCA TGG TTT 
ser asn lys leu leu arg tyr ser trp phe 

2701/901 

CAG CAT TTG ATA GAG AAC TCC AAA GTT AAG 
gin his leu ile glu asn ser lys val lys 

2761/921 

TAT CAT CAT GCA GCG GAA ACC GTT GTA AAT 
tyr his his ala ala glu thr val val asn 

2821/941 

GGA GAT AAT CCA GAG GCA TCT AAG AAC GCG 
gly asp asn pro glu ala ser lys asn ala 

2881/961 

TGT TTC ACC TTC ATG GAC AGG GGC TTT GTC 
cys phe thr phe met asp arg gly phe val 

2941/981 

TTT GCT CCT GGA GAC CCA AAG ACC CTC TTT 
phe ala pro gly asp pro lys thr leu phe 

3001/1001 

TGC. AAC CAT GAA CAT TAT ATT CCG TTG AAC 
cys asn his glu his tyr ile pro leu asn 



2251/751 

ACT CAG GAT CAG CAT TTA CAT AAT TTT TTC 

thr gin asp gin his leu his asn phe phe 

2311/771 

CAA GCC TTA GGA AAC GAA CTT GTA AAG TAC 

gin ala leu gly asn glu leu val lys tyr 

2371/791 

GTG ATG ATC GCC TTC TTG CCC ACT ATC CTA 

val met ile ala phe leu pro thr ile leu 

2431/811 

ACA CAG GAA GAA GTC GCG GTT AAC GTG ACT 

thr gin glu glu val ala val asn val thr 

2491/831 

CAT GAG GAA GGA TTG GAG AGC CAC TTG AGG 

his glu glu gly leu glu ser his leu arg 

2551/851 

CCA TAT GTT GCC TCT GAA TAC AAG ACA GTG 

pro tyr val ala ser glu tyr lys thr val 

2611/871 

ATT CTC AAG CCT TCT GCC GAT TTC CTC ACC 

ile leu lys pro ser ala asp phe leu thr 

2671/891 

TTC TTT GAT GTA CTG ATC AAA TCT ATG GCT 

phe phe asp val leu ile lys ser met ala 

2731/911 

TTG CTG CGA AAC CAG AGA TTT CCT GCA TCC 

leu leu arg asn gin arg phe pro ala ser 

2791/931 

ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT 

met leu met pro his ile thr gin lys phe 

2851/951 

AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA 

asn his ser leu ala val phe ile lys arg 

2911/971 

TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT 

phe lys gin ile asn asn tyr ile ser cys 

2971/991 

GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG 

glu tyr lys phe glu phe leu arg val val 

3031/1011 

TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT 

leu pro met pro phe gly lys gly arg ile 
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3061/1021 

CAA AGA TAC CAA GAC CTC CAG CTT GAC TAC 
gin arg tyr gin asp leu gin leu asp tyr 

3121/1041 

TTC TTG GTG GGA CTG TTA CTG AGG GAG GTG 
phe leu val gly leu leu leu arg glu val 

3181/1061 

CGT CTG ATC GCC ATC AGT GTG CTC AAG AAC 
arg leu ile ala ile ser val leu lys asn 

3241/1081 

TAT GCT TCA AGG AGC CAT CAG GCA AGG ATA 
tyr ala ser arg ser his gin ala arg ile 

3301/1101 

CTG ATT GAA AAC GTC CAG CGG ATC AAT GTG 
leu ile glu asn val gin arg ile asn val 

3361/1121 

GGC ATG ACC GTG AAG GAT GAA TCC CTG GCT 
gly met thr val lys asp glu ser leu ala 

3421/1141 

CAG AAG GGA AGC ACC CTG GAC AAC AGC CTG 
gin lys gly ser thr leu asp asn ser leu 

3481/1161 

ATT GCT TCT CCA TAT ACA ACC TCA ACT CCA 
ile ala ser pro tyr thr thr ser thr pro 

3541/1181 

AGA GGA TCT CTC ATA AGC ACA GAT TCG GGT 
arg gly ser leu ile ser thr asp ser gly 

3601/1201 

AGC AAT TCC CTG GAT AAG CAC CAA CAA AGT 
ser asn ser leu asp lys his gin gin ser 

3661/1221 

GAT AAA CTT GAC CAG TCT GAG ATT AAG AGC 
asp lys leu asp gin ser glu ile lys ser 

3721/1241 

AGC ATG TCT GAT GAT GCT TTG TTT • ACA TAT 
ser met ser asp asp ala leu phe thr tyr 

3781/1261 

GAT TTT TTT ACA ATA TCT GAA GTC TGC CTG 
asp phe phe thr ile ser glu val cys leu 

3841/1281 

ATA GCC AGG AAC CAG GAG GGG TTG GGA CCC 
ile ala arg asn gin glu gly leu gly pro 



3091/1031 

TCA TTA ACA GAT GAG TTC TGC AGA AAC CAC 
ser leu thr asp glu phe cys arg asn his 

3151/1051 

GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC 
gly thr ala leu gin glu phe arg glu val 

3211/1071 

CTG CTG ATA AAG CAT TCT TTT GAT GAC AGA 
leu leu ile lys his ser phe asp asp arg 

3271/1091 

GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG 
ala thr leu tyr leu pro leu phe gly leu 

3331/1111 

AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG 
arg asp val ser pro phe pro val asn ala 

3391/1131 

CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG 
leu pro ala val asn pro leu val thr pro 

3451/1151 

CAC AAG GAC CTG CTG GGC GCC ATC TCC GGC 
his lys asp leu leu gly ala ile ser gly 

3511/1171 

AAC ATC AAC AGT GTG AGA AAT GCT GAT TCG 
asn ile asn ser val arg asn ala asp ser 

3571/1191 

AAC AGC CTT CCA GAA AGG AAT AGT GAG AAG 
asn ser leu pro glu arg asn ser glu lys 

3631/1211 

AGC ACA TTG GGA AAT TCC GTG GTT CGC TGT 
ser thr leu gly asn ser val val arg cys 

3691/1231 

CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG 
leu leu met cys phe leu tyr ile leu lys 

3751/1251 

TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG 
trp asn lys ala ser thr ser glu leu met 

3811/1271 

CAC CAG TTC CAG TAC ATG GGG AAG CGA TAC 
his gin phe gin tyr met gly lys arg tyr 

3871/1291 

ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG 
ile val his asp arg lys ser gin thr leu 
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3901/1301 

CCT GTT TCC CGT AAC AGA ACA GGA ATG ATG 
pro val ser arg asn arg thr gly met met 

3961/1321 

GAT AAC TCT CTC ACT TTT AAC CAC AGC TAT 
asp asn ser leu thr phe asn his ser tyr 

4021/1341 

TCA TTA CTT GAA GCC AAC ATT GCT ACT GAG 
ser leu leu glu ala asn ile ala thr glu 

4081/1361 

CTA TTT ACA TTG GCG TTT AAG AAC CAG CTC 
leu phe thr leu ala phe lys asn gin leu 

4141/1381 

AAA AAA GTT TTT GAT GTC TAC CTG TGT TTT 
lys lys val phe asp val tyr leu cys phe 

4201/1401 

AAA AAT GTC TTC ACT GCC TTA AGG TCC TTA 
lys asn val phe thr ala leu arg ser leu 

4261/1421 

GGG AGA GCG GAC ATG TGT GCG GCT CTG TGT 
gly arg ala asp met cys ala ala leu cys 

4321/1441 

CTG AGC TCC ATC AGG ACG GAG. GCC TCC CAG 
leu ser ser ile arg thr glu ala ser gin 

4381/1461 

GAT TAC ACT GGA AAG AAG TCC TTT GTC CGG 
asp tyr thr gly lys lys ser phe val arg 

4441/1481 

CAG CTG ATA GCA GAC GTT GTT GGC ATT GGG 
gin leu ile ala asp val val gly ile gly 

4501/1501 

ATC AAC AAC TGT GCC AAC AGT GAC CGG CTT 
ile asn asn cys ala asn ser asp arg leu 

4561/1521 

AAG GAC TTA ACC AAA AGG ATA CGC ACG GTG 
lys asp leu thr lys arg ile arg thr val 

4621/1541 

GAG AAC GAC CCA GAG ATG CTG GTG GAC CTC 
glu asn asp pro glu met leu val asp leu 

4681/1561 

ACG CCC GAG CTC AGG AAG ACG TGG CTC GAC 
thr pro glu leu arg lys thr trp leu asp 



3931/1311 

CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG 
his ala arg leu gin gin leu gly ser leu 

3991/1331 

GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG 
gly his ser asp ala asp val leu his gin 

4051/1351 

GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT 
val cys leu thr ala leu asp thr leu ser 

4111/1371 

CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG 
leu ala asp his gly his asn pro leu met 

4171/1391 

CTT CAA AAA CAT CAG TCT GAA ACG GCT TTA 
leu gin lys his gin ser glu thr ala leu 

4231/1411 

ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA 
ile tyr lys phe pro ser thr phe tyr glu 

4291/1431 

TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG 
tyr glu ile leu lys cys cys asn ser lys 

4351/1451 

CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT 
leu leu tyr phe leu met arg asn asn phe 

4411/1471 

ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC 
thr his leu gin val ile ile ser val ser 

4471/1491 

GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC 
glu thr arg phe gin gin ser leu ser ile 

4531/1511 

ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG 
ile lys his thr ser phe ser ser asp val 

4591/1531 

CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT 
leu met ala thr ala gin met lys glu his 

4651/1551 

CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC 
gin tyr ser leu ala lys ser tyr ala ser 

4711/1571 

AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC 
ser met ala arg ile his val lys asn gly 
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4741/1581 

GAT CTC TCA GAG GCA GCA ATG TGC TAT GTC 

asp leu ser glu ala ala met cys tyr val 

4801/1601 

ACA CGG AAA GGC GTG TTT AGA CAA GGA TGC 

thr arg lys gly val phe arg gin gly cys 

4861/1621 

GAC GAG GAG GCC TCC ATG ATG GAA GAC GTG 

asp glu glu ala ser met met glu asp val 

4921/1641 

GTG CTG ATG GAG CTC CTT GAG CAG TGC GCA 

val leu met glu leu leu glu gin cys ala 

4981/1661 

CTC ATC GCC GAC ATC TAC AAA CTT ATC ATC 

leu ile ala asp ile tyr lys leu ile ile 

5041/1681 

GAA GAT GAA GAT GGA AAG GAG TAT ATT TAC 

glu asp glu asp gly lys glu tyr ile tyr 

5101/1701 

ATT TCT CAG AGA CTC CTT AAA CTG TAC TCG 

ile ser gin arg leu leu lys leu tyr ser 

5161/1721 

ATA CAG GAT TCT GGC AAG GTC AAC CCT AAG 

ile gin asp ser gly lys val asn pro lys 

5221/1741 

GTG ACT CAC GTC ATC CCC TTC TTT GAC GAA 

val thr his val ile pro phe phe asp glu 

5281/1761 

GAG AGA TCC CAC AAC ATC CGC CGC TTC ATG 

glu arg ser his asn ile arg arg phe met 

5341/1781 

AGG CAG GGC GGG GTG GAA GAG CAG TGC AAA 

arg gin gly gly val glu glu gin cys lys 

5401/1801 

TTC CCT TAT GTG AAG AAG CGC ATC CCT GTC 

phe pro tyr val lys lys arg ile pro val 

5461/1821 

ATC GAG GTG GCC ATT GAC GAG ATG AGT AAG 

ile glu val ala ile asp glu met ser lys 

5521/1841 

TCG GCC GAG GTG GAC ATG ATC AAA CTG CAG 

ser ala glu val asp met ile lys leu gin 



4771/1591 

CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC 

his val thr ala leu val ala glu tyr leu 

4831/1611 

ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC 

thr ala phe arg val ile thr pro asn ile 

4891/1631 

GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT 

gly met gin asp val his phe asn glu asp 

4951/1651 

GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG 

asp gly leu trp lys ala glu arg tyr glu 

5011/1671 

CCC ATT TAT GAG AAG CGG AGG GAT TTC TTT 

pro ile tyr glu lys arg arg asp phe phe 

5071/1691 

AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA 

lys glu pro lys leu thr pro leu ser glu 

5131/1711 

GAT AAA TTT GGT TCT GAA AAT GTC AAA ATG 

asp lys phe gly ser glu asn val lys met 

5191/1731 

GAT CTG GAT TCT AAG TAT GCA TAC ATC CAG 

asp leu asp ser lys tyr ala tyr ile gin 

5251/1751 

AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT 

lys glu leu gin glu arg lys thr glu phe 

5311/1771 

TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG 

phe glu met pro phe thr gin thr gly lys 

5371/1791 

CGG CGC ACC ATC CTG ACA GCC ATA CAC TGC 

arg arg thr ile leu thr ala ile his cys 

5431/1811 

ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC 

met tyr gin his his thr asp leu asn pro 

5491/1831 

AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC 

lys val ala glu leu arg gin leu cys ser 

5551/1851 

CTC AAA CTC CAG GGC AGC GTG AGT GTT CAG 

leu lys leu gin gly ser val ser val gin 
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5581/1861 

GTC AAT GCT GGC CCA CTA GCA TAT GCG CGA 
val asn ala gly pro leu ala tyr ala arg 

5641/1881 

TAT CCT GAC AAT AAA GTG AAG CTG CTT AAG 
tyr pro asp asn lys val lys leu leu lys 

5701/1901 

GGT CAA GCC TTA GCG GTA AAC GAA CGT CTG 
gly gin ala leu ala val asn glu arg leu 

5761/1921 

GAA ATG AAA GCC AAC TAC AGG GAA ATG GCG 
glu met lys ala asn tyr arg glu met ala 

5821/1941 

ATC TGC CCC CTG GAG GAG AAG ACG AGC GTC 
ile cys pro leu glu glu lys thr ser val 

5881/1961 

ATC AGT GGG ACT CCA ACA AGC ACA ATG GTT 
ile ser gly thr pro thr ser thr met val 

5941/1981 

TGA TTA CAT CTC ATG GCC CGT GTG TGG GGA 



6001 

TTT CCA AAG CCA ATC ACT GGG GAG ACC GAG 
6061 

AAG GAA ATA AAG AAC AAC GTT ATT— TCT TAA 
6121 

CAC ATA TTT TTT TAA ATC TCA CTG GCA ATA 
6181 

TGT GGT AGA CAC. TCT TGA GCT GGA CTT AGA 
6241 

ATA GAT GGC CTA CAG AAA AAA AAG GTT CTG 
6301 

CAT TGA TGC CTG GGG GAC CTT TTG CCT CGA 
6361 

TAC AGA ACT TAC TAG TTT TGT CTA GGA GTA 
6421 

TCA TTC AAC AAC ATA GAG CAA GAA TAG TGA 
6481 

CTA CTG GCT TCA AGT CAG AAC TTT GTC ATT 
6541 

ATT ACA TTT CTA CAT TTT TAA TAC TCA CAT 
6601 

ATT TGT GCT GGT CCA GTA TAT GCA ATA CAC 
6661 

GCA ATA TGG AGA TGT ATA CAA GTC TTT ACT 



5611/1871 

GCT TTC TTA GAT GAT ACA AAC ACA AAG CGA 
ala phe leu asp asp thr asn thr lys arg 

5671/1891 

GAA GTT TTC AGG CAA TTT GTG GAA GCT TGC 
glu val phe arg gin phe val glu ala cys 

5731/1911 

ATT AAA GAA GAC CAG CTC GAG TAT CAG GAA 
ile lys glu asp gin leu glu tyr gin glu 

5791/1931 

AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG 
lys glu leu ser glu ile met his glu gin 

5851/1951 

TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC 
leu pro asn ser leu his ile phe asn ala 

5911/1971 

CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG 
his gly met thr ser ser ser ser val val 

5971 

CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC 



6031 

CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA 
6091 

CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG 
6151 

TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG 
6211 

TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA 
6271 

GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA 
6331 

CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG 
6391 

TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC 
6451 

GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG 
6511 

AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT 
6571 

GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA 
6631 

TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT 
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A. Allelic variations: single nucleotide changes (polymorphism) between CLASP-2 cDNA 
i so forms 



Isoform 


Difference 


Nucleotide(s) 


Consequence 


1 


polymorphism 


862 


A to G change; mis-sense 








mutation 


2 


polymorphism 




A to C change; mis-sense 








mutation changing codon from 








histidine to proline 


3 


polymorphism 


2210 


A to G change; mis-sense 








mutation changing codon from 








aspargine to glutamic acid 


4 


polymorphism 


2225 


C to T change; mis-sense 








mutation changing codon from 








histidine to tyrosine 


B. Alternative splices 


Isoform 


Difference 


Nucleotide(s) 


Consequence 


1 


exon deletion 


209-291 


premature, in-frame stop codon 








leading to the production of a 








truncated, most likely soluble 








protein 



These differences may be found separately or together in various combinations in the different 
human CLASP-2 isoforms 
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1st exon (nucleotides 335 to 445) 

TGTCTTGCTTATCTTTTCGCCCTCCAGGCAAAG CCAAAGCTAATTGAGCCACT 
CGACTATGAAAATGTCATCGTCCAGAAGAAGACTCAGATCCTGAACGACTGT 
TTACGGGAGATGCTGCTCTTCCCTTACGATGACTTTCAG GTAAGTAACGTTAT 
GTTTCTATCCGTAGAACCACG 

2nd exon (nucleotides 7101-7190) 

TTACCCAAGGCTTTTCCTCCTGTTTTTGTTTCCAG ACGGCCATCCTGAGACGA 
C AGGGTCG AT AC AT ATGCTC A A C A GTGCCTGCG AAGGCGG AAG AGGAAGC A 
CAGAGCTTG TTTGTTACAGAGGTAAGGCTCTTTCCTGCATTAATTTACATTTT 
GAAGTCATTTTCCCCTAACTGCCTCC 

3rd exon (nucleotides 1 1439 to 1 1521) 

TTTTCTA 1 1 IT 1 AAAATCCCCCTTCAATAG TGCATCAAAACCTATAACTCTGAC 
TGGCATCTTGTGAACTATAAATATGAAGATTACTCAGGAGAGTTTCGACAGC 
TTCCG AA GTG AGT AAGCTAT ATT AT A CAC. A T A GGG A A A AGTCTTT 

4th exon (nucleotides 13987 to 14056) 

CTAAAACAAATTTTCTTTGTTGTTTTTATAG CAAAGTGGTCAAGTTGGATAAA 
CTTCCAGTTCATGTCTATGAAGTTGACGAGGAGGTCGACAAAGATGAG GTGG 
GATACCTGCTTGCTGTTGCTTCTCTTTTCACTCTAGATTTAA 

5th exon (nucleotides 15212 to 15307) 

GGAGGTTGACTGCTGGTGTTTTCCTTCTCTCCTAG GATGCTGCCTCCCTTGGTT 
CCCAGAAGGGTGGGATCACCAAGCATGGCTGGCTGTACAAAGGCAACATGA 
AC AGTGCC ATC AGCGTG AC C ATG A GG GTG A GG A C.GC AC. A TC. A CTTTGCCCTC 
CCCTCTCACAAGCCCTTTC 

6th exon (nucleotides 16269 to 16404) 

TGAAAGAATAGCTGTGTGTATATTTTTCTCTCA GTCATTTAAGAGACGATTTT 
TCCACCTGATTCAACTTGGCGATGGATCCTATAATTTGAATTTTTATAAAGAT 
G AAAAG ATCTCC AAAG AACC A A AAGG ATC AATATTTCTGGATTCCTGTATGG 
GTGTCGTTC AG GT AAAT ATG A A A AG A GTTTT APfA TT A TGTTTTGTT A 

7th exon (nucleotides 19459 to 19633) 

AAGTATGTCTGTTTATCCTTTTTTCATTTCAG AACAACAAAGTCAGGCGTTTT 
GCTTTTGAGCTCAAGATGCAGGACAAAAGTAGTTATCTCTTGGCAGCAGACA 
GTGAAGTGGAAATGGAAGAATGGATCACAATTCTAAATAAGATCCTCCAGCT 
CAACTTTGAAGCTGCAATGCAAGAAAAGCGAAATGGCGACTCTCACGAAG GT 

agataggcttggcttcccccaggcacatacacactct 

8th exon (nucleotides 20567 to 20634) 
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ATTACAAGTGATTCCGATAATCTGTTTTGCCATTTTAG ATGATGAACAAAGCA 
AATTGGAAGGTTCTGGTTCCGGTTTAGATAGCTACCTGCCGGAACTTGCCAAG 
GTAACATCGTCTTATATCTTCTGCTCTTCGTTGAATGC 

9th exon (nucleotides 30257 to 30331) 

GATTGTGTTAAATGTAATTTTCATGTATCTTGTTATCAG AGTGCAAGAGAAGC 
AGAAATCAAACTAAAAAGTGAAAGCAGAGTCAAACTTTTTTATTTGGACCCA 
GATGCCCAG GTAAGAACTATCTAAATGTTTAATATTTAAAACCAAAT 

1 0th exon (nucleotides 3 1 85 1 to 3 1 99 1 ) 

CATAACTTATTTATATGTTTACATTTTCTTTTAAAG AAGCTTGACTTCTCATCA 
GCTGAGCCAGAAGTGAAGTCATTTGAAGAGAAGTTTGGAAAAAGGATCCTTG 
TCAAGTGCAATGATTTATCTTTCAATTTGCAATGCTGTGTTGCCGAAAATGAA 
GAAGGACCCACTACAAAT GTAATTTTTC ATTTTA A A A ATA A AC ATT A A A A A A 
AAAATAGGCAG 

1 1th exon (nucleotides 32472 to 32675) 

CCATGGTGATCATTGGATTGTTTTGTTTTGTTCA GGTTGAACCTTTCTTTGTTA 
CTCTATCCCTGTTTGACATAAAATACAACCGGAAGATTTCTGCCGATTTCCAC 
GTAGACCTGAACCATTTCTCAGTGAGGCAAATGCTCGCCACCACGTCCCCGG 
CGCTGATGAATGGCAGTGGGCAGAGCCCATCTGTCCTCAAGGGCATCCTTCA 
TGAAGCCGCCATGCAGTATCCGAAGCAG GTGGGGAGTATGAGCCCAGCATTC 
CCACTACTCAGACTCACTTTGCATGC 

12th exon (nucleotides 33063 to 33185) 

GAATTCTGCTTACTGAAGAAAATTGTTTGCCTCCTAG GGAATATTTTCAGTCA 
CTTGTCCTCATCCAGATATATTTCTTGTGGCCAGAATTGAAAAAGTCCTTCAG 
GGGAGCATCACACATTGCGCTGAGCCATATATGAAAAGTTCAGACTCTTCTA 
AGGTATGAATGGCTTTTACGCTTTGGGGTGGTAAAAAGCAATCTGAA 

13th exon (nucleotides 36702 to 36784) 

CAGTATCTCATAGCTTTATTCTCATGTCTTCAAG GTGGCCCAGAAGGTGCTGA 
AGAATGCCAAGCAGGCATGCCAAAGACTAGGACAGTATAGAATGCCATTTGC - 
TTGGGCAGCAAG GTAAGGAACACCTTTTATACCTTTTAAATCGATATAGATA 
GGTGCATGG 

14th partial exon (nucleotides 37353 to 37475) 

GAAACCCAGTTTAGAAATGTTGCTTTGCCATTTCAG GACATTGTTTAAGGATG 
CATCTGGAAATCTTGACAAAAATGCCAGATTTTCTGCCATCTACAGGCAAGA 
CAGCAATAAGCTATCCAATGATGACATGCTCAAGTTACTTGCAGACTTTCGG 
AA 
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1 TACCAAGGGCAACTCTGGCACACCCTAAAGTCTGGAAAGGGGACATT^ 
9 3 TG<^GAGCTTCCTTTGTGCTTAAATAACTGAACAAGCAT^ 

18 5 CTTATTTCTGTTTCTTTGTGATACCAAAACTAGCATACTCTA^ 

277 AAGAAATGGCATTTGTCTATGAGAATGTCTTGCTTATCTT^ 

3 69 TCATCGTCCAGAAGAAGACTCAGATCCTGAACGACTGTTTACGGGAGATGCTGCTCTTCCCTTACG 

4 61 TTTCTATCCGTAGAACCACGTGTTTGATCTTAACAAGCAGTATTTTTCTATCT 
55 3 ATTTTTTTTGAGACACAGTCTTGCTCTGTCACGCAGGCTC 

64 5 AGCATTTCTCCTGCCTCAGCCTCTCAAGTAACTGGGATTACAGGCGTGTGCCACC^^ 

737 GTTTTGCCACGGTGGCCAGGGTCGTCCCAAACTCCTGGCCTTAAGTGATCTATCTGCCTTG^ 

82 9 GCCACTGTGCCCGGCCTAATTATGGTTTTTAAAAGATGAAAATA^ 

921 TTCATGAGTTGATAATTTTTAATGGTATTATTTTTGCATATTAAITATAACT 

1013 GTGTATCTTGAATAAAAGTGCTATACTCTGTCTGGGCTGATTTAGTGGGCACAACT 

1105 TTTTTGGCCAAGTACCTTAGAATCTTAATGATGGGCTTTGCTG 

1197 AGCCTGCCTTCAGAGGCCCTGCACTCTGTGTGGGCTCCCAGCAAAGCGTTCAAGGn^ 

12 8 9 AATGTGGGTGCTGTTCCTTGGAA.GARATGTGGAGGACTCAGCACAGCTCCTGCTCT 

1381 ACTAACAAGAAGCTGTGAGGCTGTTGAGGAATGAGAATGACAT^ 

1473 TTGGTCCATTTGGAGGTTCTGGTTACTTTCCTTGCTCTCC^ 

15 65 ACACTTCTCTACCTGATGCTGTTACCATTTAAGCCCTGTCTTTTGTGAATCGAGCTGC 

1657 TTGATTCCAAAAGTGTTACATCCATGTACAAAAAGATAAATGAGAGGGAAATATTGAAATAAT^ 

174 9 ATCCCATCCCACAGAACATGACTTAACTGTATACAGCTCTGT 

18 41 ATCCTTGAAGGTTTTTTCCCACCAAATTTAAGCGACTC 

1933 AGAGATACCTGGGCCCCAAAATGATTATTCTTTAGCTCT^ 

2025 TGCTACCAAAGTAACAGGTAAATGGGTTGAGGTCATGCCAAACA^ 

2117 GACTAA.TAAATGAGAACCTCTGAATGATGCCTTTTGCCTGTGATTTGGCAACAAATGAAAA 

2 209 TGTTGACAAGATTTTCCTGTAGTGTTCTCTGAGGAAGCTAAAGG^ 
2 301 ATTAAAGTAACAACTTTTTTGAGTTTGCAACCTAGAATGAA 

2 393 ACGAGATACAGTTAGTTGAGTGTCATCTTTATCCCAGGGAACAG^ 

2 4 85 GAAAACTTCTGTGTCCCCTTTGCTTTTAACTACTCTGGTGA^ 

2 577 GGCTAAGGTGATGAGCAAATATTATTTTCTGTTTGGGGA.GAAG1TTTCCT 

2 669 GGAATTGTGATGGGC CC CAGTGAAGTTTGGGTACAATTATTTGTTTTCTTATAGACTCCCACTTTCTT 

2761 GCTTTCTTGTTAAATTGTCTCAGTGATGTTATTAACTCT 

2853 TGTGTTTTCTTAGGCAGAATAAGAGCAGAATT^ 

2 945 GAAAAGTAATACCAAGTTGGTTAGGAAATGGCAGCAGCAAAACGCATGCTGAGGGGTGATT^ 

3037 TAACTAITAAAATAATAGAACTTGCTCTCCATTTCT 

3129 CTTGTGTACATTTTTCAAGTAGGCACATGTTGATCTTGAG^ 

3221 TGTCAAGAAAAGATTTGCGGGTTGCATtnAGTTGCCTGAGGATGACA 

3313 AAACGTGGATTTTTAAAAATCAAAAGAATAGGCCAGGCGC^ 

3405 ACAAGGTCAGGAGTTTGAGACCAGCCTGGCCAATATGGTGAAACCCCATCTCTACCAGA 

3497 CCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCTCT^ 

3589 TTCCAGCCTGGGCGACAGAGTGAGACTCCATCTCAAAACAAACAAAAAATCAAAATA^ 

3681' CTGGGCACGGTGGCTCACACCTGTAATCCCAGCACATTGGGAGTCCGAGGC^ 

3773 CAACATGGCAAAACCCCGTCTCTACTAAAAATACAAAAATT^ 

38 65 GCAAGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCCAAGATCGCGCCA 

3957 ATCTCAAAAAAAAAAAGGGAATATTAATGAAGTAAAGTACATGTGATCTGCCATGGC 

404 9 CTAGCTGATGGCCCTTCTTTTTGCAGAATCAGATCCTGTCGCTTGG 

4141 AGCATCTTCTCTTCTAGATCTTTCCTACCCTTTAGAGACCACTTGAA^ 

4 325 GGCTTCTTGCA^GCAGGCATCCCGCCCCGTGCTGTGCTrCCTGAG 

4 417 CATGCACAAAATCCCTTTCTTGCTAGGTGCTAGGGTTGAATACCCATTGCTTACCTTACT 

4 509 TTCTTTGATTCATCAAGTAAATATTAATACTGTTTGGAACATGTGAT 

FIG. 12B 
1 of 10 



4 601 TTGCITTGGCAGAACTAGGTAATTTCTAAAATTA^ 

4 693 CTTTATTTTTCTTATTGTTTTGTTGTCOTAC^ 

4 78 5 TTTAACAAGCAGTTAGACAGAGGTAATGACCTTTAGATTAAAAGGTTTTAGCT 

4 877 GCCTCTGTTTAACTGAAAGTCAAGCCCAGGACGCCTGCCTT^ 

4 969 ACGAJTCTTCTTCAAGTGGAGCTCTTTTTACCAGATG^ 

50 61 GGCTTTTAAGCTCCTTGTATCCTGACTTGTTACACAGCTTACTTATGCTTGTGCGAC 

515 3 TAGTAAGAATGTTGGGAGACAATTTAAGCTACCATTCA^ 

52 4 5 TTGTATGAAAGGATAAAAGAGCGTTAAGGGTTACA^ 

5337 GGCATGGTGGCTCACGCCTGTAATCTCAGCACTTTGGGAGGTC 

5 4 2 9 ATGGTGAAACCCCATCTCTATTTAAAGAATAAGGCTGGGTGTGGTGGCTCACACCT 
5521 TTGCTTGAGTTCAGGAGTTTGAGACCAGCCTGGGCAACA^ 

5 613 CCCGCCTGTAGTCCCAGCTACTCAGGAGGCTGAAGCATGA 

5705 TGCACTCCAGCCTGGGTGACAGAGAGAGACTCCGTCTCAAAAAAATTA 

5797 CTTTGGGAGGCCGAGGTGGGCAGATCACGAGGTCAGGAGATTGAG^ 

5 8 8 9 AAAATTAGTTGGGCATGGTGGCAGGCGCCTGTAGTCCCAGCTGCTCG 

5981 GCAGTCAGCTGAGATTGCGCCACTGCACTCCACA^ 

6073 ATTAAAAAGAAAAAGAAAAGGAAACCTTAAGCCTAGTTATTGAGGTAGAC^ 

6165 TTAAGCCTAATGAACACGAGCAGTTCTARTGTCCGTTGGAGGGGAGGTAGCAT^ 

6257 ATCTTCTGTGTGTCTAGTTATCTATGCTCTTAGGCGCTGGGCAT^ 

634 9 GATGGTTGCACAATTGTGTGAATGTACTTAATGCCACTGAACTGTATACTTAAAAA 

64 41 AATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTG^ 

6625 GGCGGAGGTTGCAGTGAGCCGAGATCCCGCCATTGCACTCCAGCCT 

6717 TAAATTTATGCATATTTTACCAGAATAAAAAAAGGCAGTTAAGACAAGTAAGATGCTCT 

68 09 , GTGTTAGGGACTTTCAACGGGGCCTCTACCCTGCGGGAAGQ 

6901 TAACCCAGCCTGGCTTTGGAACTTGAGGCTGCCC7VGTGGAATCTGTTGTGTGTGACGG 

6993 GGGACTCTGTGGCTGCCAGGGCCAGCTGCAGGGa^C^CAGCTGCACTCTGAQ 

7085 CTGTTTTTGTTTCCAGACGGCCATCCTGAGACGACAGGGTCGATACATATGC 

7177 TGTTTGTTACAGAGGTAAGGCTCTTTCCTGCATTAATTTACATTT^ 

72 69 TGTCAAGGAAGTGTCAAAAGGCTAATTGTATTTCTATGATGGA 

7 3 61 AATGTCCACAGGGGCTATTTCTTTTTTACATTTTTA 

7 4 53 TTCOCAGAATTTATCTCATGGACTTAAAATAAGCAGTAACTTGTAAATG^ 

75 4 5 TTTTAAGGGTGACACACACATTTATGTATCAT^ 

7 637 GCTGATAATAAGCAGGGTCTATCGCTM 

7729 ATTATTTTTGTTTGAAAATGCAAATAAAATTATCTTATGGAAGAAAGATAAAT^ 

7913 TCAGCCTCCCAAGTAGCTGGGATTACAGGCGTGCACCACCAC^ 

8097 CAGAAAGAGAAATTATTACAATTTAGGTTGTTTGCTTTAGTTTT^ 

8189 GAATAATTAGATGTTTTAATTTTGTTTCTTTAAGT 

8 281 TACATCATAAATATGCTCATCAATAAA^ 

8 373 CCATTGAGATTTTTGAAAAATTAAATATAAAATTAAAAAATTTT^ 

84 65 CCTTTGCCACTATATTAGTAAACTTAATTCCAGACAAAX^ 

8557 TGGCAAACTATGACCTGCAAGCAAAATCCAGCCTGTAGCCAATTTTTGTA 

8 649 TATTGCCTATGGCTACTGTCACCATGCAACTCAAAGTTAAGTAGAGATAATATGACCCTCA 

8741 CACAAAAAGTTTGTTGACCTATGTTTTAAAGCATGTGGCAAAATTATTAA 

8833 TTTGAGGGAGAACTCTCCATTAAGTTATTTAATCACTGCAGGTTGAGCA 

8925 ACTTTTGGGACCCTTCCACTTGGGTGGAGTGAACATCACTTCCTCTTCATCC^ 

9017 AACAAAGCAGAAAAAAACTTCAAAAACCrTTCAGTGTAACTTCAA^ 
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92 93 TG^TCTCTAAATACTTAGCCAGGTTCACTCT 

938 5 TGAACATAATAAACTGTAATTATTTTCCCCA^ 

94 77 TTTGGGAGGCCGAGGCGGGTGGATC^CGAGGTCAGGAGTTCGAGAC 

95 69 AAGTAGCCAGGCGTCGTGGTGTTCGCCTGTAATCCCAGCTACTCGGGAGGCT^ 
9661 AGTGAGCCAAGATTGCACCACTGCACTCCAGCCTGGGTAACAGAGCAAGATC 

975 3 CTCCATTCATAAATGCATTTATAATATACAAATTTATGCTGGTTTATTTAGTTTTTA 

98 4 5 TAATGTCCTCX^GGAGACTTGGTGGTACAATTTCAGTTCTAt^ 

9937 ACAAGACCACGTCGTACATAGTCAGGCCTGACTACTTAGCTGTG^ 

1002 9 TCTGAAATCTGACTGGTTTTGGAATGAAATTTTGAGTGTGAATGT^^ 

10121 TTTCAAGGATAAAATAGCATTTTAAAATATTCTTAACTACAGAGTAAAAAAATCAAA 

10213 GCCCTTCTCTGCCATTTATTCCACCTCTCAGAATAATTTTAAAAACTTACATTACCTTCACA^ 

10305 TTATTGTAGAGCCTGGCACAAAATAAGCACTTGCTGCTGTTGTAATTCTTACTTTC^ 

10397 ACCTTCAGGCATGGCCTGTGACACAGATAGAAAGGTTGACAGT^ 

104 8 9 TACAGCCAGAGCCAAATAAACTAGATAAGGCATAAATACATTC^ 

105 81 TGAAGTCTGCACATAAATAGTCAAGTGCTCACTGGACAGA 

10 673 GACTCATTCATATTTGAAGATCTGGTGGATGTGGCATTATAATCTTAGGCTTTATGCA 

107 65 GACACCAGGCTGGCCCATTGAATCTGTCATGTGGCGAGGTAGTTTGAATGCA 

10857 GAAGACTACATTAAACATTCTGTACACACACACAGATGAAAATGTCATGGTGCTAATATTAATTG^ 

1094 9 TCACTCTTTTCCTTATGAATGGAAAGAAAAACTGAGGGCAGATTTATATGGTAATATTA 

110 41 ATACCATGAAGGCGTCTTTAAGTTCACGCATCCTATTCACTTTTATGT 

1122 5 AACTGATACTACCCCCAAGGGTAATGTTTCAGTGGTTCAA 

11317 TTGATGGGAAAAAATTTTTTTGATGTTTCTATGGGTATGCCTTCACA^ 

114 0 9 TTTTCTATTTTTAAAATCCCCCTTCAATACT 

11501 AGAGTTTCGACAGCTTCCGAAGTGAGTAAGCTATATTATACA^ 

115 93 TCATATGCAATCAGAGTAATTGAGGAAAATATTTTTAGATGGTTTATGTGTATG 
11685 CTCAAACATGCTACTTTGGTATTGATAGGTAGTAATAAACATGTGGATGZ7ITTAATTA 
11777 ATAAATAGGTGAAATTTTTTCACTACAAAATATAAAACAA^ 

11869 AATATTTTGAAGAAGATAATTATAAAGAGCATTTTACATTGAATAAATTT^ 

11961 AGTTAAATTTTTTTTTTATCTCCTTTAATGTATTAACTTATTCTAGA 

1205 3 GTGAATCGCGGTAGCTAGGTATTGCCTTGACAGACTATCTTTTATAAAAGGTTCCATTGTCT 

1214 5 AATTTCTAAAGAAAAGAGATTTATTTATTTATTTTTATTTAT^ 

12237 CACGATCTCTGCTCACTGCAACCTCCZACCTCCCAAGTTCAAGCAACTCTCCCTC 

12329 ACCACGCCCGGCTAATTTTGTATTTTTAGTAGAGACAGGGTT^ 

12421 CTGCCTCTGCCTCCCAAAGTGCTGGGATTACAAGCATTAGCCACCGAGACTAGCCAAGAAAAGA 

12513 TGTACAAGAAGCATAGGACTGGCTTCTGCTCAGCTTCTGTTGAAGACTTTTGTGCTG^ 

12605 TGGCACTTGTGAAAAGAGACCAAAGAGGAGGAQ3LAAACTCAC 

12697 TCCCATCTTGCCAGGAACAAGAATTCACTCATTACTGTGAGA^ 

12789 ATTATGCAACAACGGGGACCAAATTTCAATCTGAGTTTT 

12881 TTGATTTTGGGAAAATTGAAAGCAAATAGTGATGGATTATGTITTAAAACTAAC 

12973 TATT ITP CTTTGC G TI TG TTACATCTTTGAG^ 

13065 TTGCCACTCCTTAAGAATGTTCGGCCAATTCCCCGATTGCCTCTTTTTAAACCTCAGCCW 

1 3 1 S 7 ATGGGTAGCCCTTTAGTTCTATATTTACCCAATCCTCCCTTAGGGATTTTTTAATTCT^ 

1324 9 CCTCTTGTTCATAGTCTCAGGGTTGAAAGATATCAGACTATCT 

13341 GGCATTAATGAGAATTTCCAAATGTGTGATGAGAAAGAGAGGGA^ 

13433 GACTCTGCCGTCTGTCAGCCTGATTTTCCTCAir TGTTAAATGGC CTTAACAAGAGTCACCATGATTAAAGGATTAAATGAGAGGGCACATG 

13S25 AGAAGTGCGCAGGGCCTAGCACGTCATACCCAITGAGTAAATGTAAG^ 

13617 TCATTGTTAATCGGTTTCAACGCAACAACTCAATCTTCCTTTTTITCC 

13709 TTAAAAACTAATCCACTTTCTTATTTTTAGTATGCCTGCTAACTCCCCAGA 
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.3801 ACTCAAGTCTCTTGGCTTACCCACCTTGAAAAGCAAGGGCATA 

.3985 AGCAAAGTGGTCAAGTTGGATAAACTTCCAGTTT^TGTCTAT 

.4077 TGCTTCTCTTTTCACTCTAGATTTAAACATCAATTTTACAGACTT 

.4169 GAGTGTTACATGAAACGGGAATAATTTTTTAAAAATGTTATTGATTGATTGGAATA^ 

.4261 GTTCTCTTGGTTGTGTTTATTTGTTCTGAGATCATTTTCAAAGACTTGGATCAGATCTGGCTAC^ 

.4353 ACCTTGAAITTAGGTTGTTTTTCAACAGATCTCGAAACAGCTGCCAGCCAGT 

L 4 4 4 5 AGTCTGATTTGATCAAGCCCACTCCCCCTATTCCTAGAGGAAAGCTCAT^ 

L 4 5 3 7 TGCCCACTGAGTGAGGGAAACCTCACAAGAAGACAATGCCCATCTCTGCATTTCTCATCCTCC 

14 629 TTTAGGTTTTTCCTTCTTTAAAAAAATTGT^ 

14 7 21 CCCCCATCACTTTATCTCTCCTTTCTGTATTGCCTCAATTTCCTCCCTTGCTTTATTCACCTTTCCCTGAACT 

L 4 8 1 3 CAGGAATGTGCATGTGCTTTTCTCCTCTGACTATAGGGGACT 

L 4 9 0 5 GTGGTCAGTTGAQ3TATGTCCTTTTTGGTCCTTTTGTGGCTCATTAAACACTGACA 

14 997 TGGAATAAAGTTTTCAGAAATGTAGGCGGGTCTCTCTCTTTCACCGCTAAGTGGACTT^ 

15 089 CTTCAGTAAAGGGGCAATGGACAACTTGGCACAAAGGGAATGACCTTC^ 
L 5 1 8 1 GTTGACTGCTGGTGTTTTCCTTCTCTCCTAGGATGCTC^ 

L 5 2 7 3 GCAACATGAACAGTGCCATCAGCGTGACCATGAGGGTGAGGACGCAC 

153 65 CGAGAACAATGCTCAAGATGAATGCGCATGCTGTTCTTCCCCACAA 

15 4 57 CCCATTGGCA.CAGCCTCATCCACCCACTTTCCCT 

155 4 9 ACACTGACATAGGCTTTCTTCTCTGTAATAACTGAAAAGTGCTCTTTGGTACCTCAC^ 

15641 CTGAGCACTTCTGTGGAGGTGTACTGCAGCAAAGTCAAGTAAAGCAAAAATTGAGG^ 

15733 GCAGACAGAGAAGTCAAACCAATAGAAGAAGCTATTCAGGAGAAA^ 

15 8 2 5 AAGGGAGTCAGGGAGATAAAAArTAAGGAGGAAATGTGACTCT 

15 917 TTCCCATTCTGTGAGCAAGAAACCCTGGGGATTTTAGA^ 

16009 TTGGGTAGTTATGTAAAATCTCTGATTCCGTGGGTGAGAA 

16101 AGCTTCTTAGATGGGATGGGGAGTCTTGACCCCACAGTGACCTGGAGC^ 

16193 GTCATAAGTGGGTTATTGATAGAGATTGTGACCCTCTTCATTTTGAAAG 

162 8 5 TTTTTCCACCTGATTCAACTTGGCGATGGA^ 

16377 TCTGGATTCCTGTATGGGTGTCGTTCAGGTAAATATGAAAAGAGTTTTACCA 

164 69 GCAGATTTAAGCAAACACTTCCAAAAATGGCAATATG^ 

165 61 TGTAGAAGATGGAAGGGATAATGTAGAGGCAGA^TTATGCTGTGGCAGGCAGGAGCACTCT 
16653 AACAGGCTACTTAATTCAGATAATGAGAATGTTTCTTTAATACGGCAAATGACT 
16745 TTC^TAGTATCTCAGTGGTGATTTTTATCGCTAGCATTGTAGT 

1 6837 AGTTCAAATCCCTGCTCCTCCACTTACCAACTGTGTAACCTT 

16929 AGGATAATGGCAGTACCAAATATGGTTACTGAGAGGGTTCAT^ 

17021 TGGTCAGTATTAGATA G rriTGTTATCATAGGGCTGTTGTACTTTTA 

17113 AAGATAACACATGAATGTATTTTTCTTGTAAAAAATCAGCCAATACAGA^ 

17205 CTTTTCTGAGGGGAGCTACTACCAGTTTTGCATGC^ 

17297 ATCATCTGTCCATCCATCCATCCATCCATCCATCTGTC 

1738 9 TTATAAGCATACACAGGACATGGGATAGCATAAATTTAAAGTGGGGGTGAAAGCAGAA^ 

174 81 AATAGAGTTAAGGAGAAAGCGTATGTTTTGAAGATCTAACACCTGCT^ 
17 573 CTGTTTAGTCACGCAATTCACAGTGCACATGAGATAAAGGCAT 

17 665 TAAAGCAAAACTAAATGTATTTGGCAACCTCATTTTTTA 

17757 GCCCAGGGACAGATGTTTATAAGTACAACTGCCCTGAGCTATCAATTAGTCTCCGGGGTGCAT^ 
178 4 9 AAGTCTCTTCCITCAAAI^GTCrTTGCAGATGCAGCTGATGGTGTTTTCA 
17941 TGTTTTTATCTGTGTCTGTTTGTAAACTAAGCAT 

18033 TAGTTGACTGGTGCTGTGAATTAAAAAAAGTGCCTAAACTAAAAAATTTTGAAG 
1812 5 AGATGAGAAAACTGAGGCTCAGAAACAGAAATTTAGAATTGAGGCCTAA^ 

18 217 GTATGCAGGGAAAGGAGGAGAGAAAATTCATTTTGTTTCAAGCCTT^ 

18 309 GTGGCAATTACTGAGCCCACAGCAGACAGTCTGTGCACAAGAGTGTGTGGCTTTGCC 
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TGCTGCAGAATCACAGAGACTTTTGGGAGGGTTGCCC 
GATTTTACTGGGTATGCGGGGGGCTGTCCTG^ 

TTGTGCCATGATCCTCAGAGCTCAACCTCTGTCTACTGCGGCCAACCT 
CTGGCATCTATTGGCTTAAGGCCATAAATTTC 

TCAGTAGTACCGAGATTGAGAAATCCTGGCCTAGTGCATGTTCAXCT^ 
AGTCACCCTCTTGCAGGCATCAAAACTGGAGGCATGAGGCAACGC 

AAGTTC CAGATAAGT AAGC CAAACAT AGT AAT AGATAAT GGTT GGC TTTTAAAAATGT AAT AC CATACACTACTTCATTAAAAAAATAGGAG 
CTGAAGAAATATGAAAATTTTACATGAAATTTCA^ 

GAGACTAAGGATGTGTAGCACTGACAAAAATGGGAGCACTGAGGAGGTTTG^^ 
AAGCAGCTAATGAGTCAGAAGGAAAAATAAACGTTGCAGAAAGTGAAGCA 

ATGAGGAGGGGGCAAGCTATTTAAAATAGCTTGGTTCTAAATGCCAATGAGATATCACTCACCmC 

TTCTAATTCTTTTAAAGTATGTCTGTTTATCCTTTTTTCATTTCAGAACAA 

AAAGTAGTTATCTCTTGGCAGCAGACAGTGAAGTGGAAATGGAAGAATGGATCACAA 

TTGCCAGGTGGGTATAAGAAGGAGACCTGTGTTACACAAGTACATGAGAGGTCGGACGG 
GGTTTTGTCACATTATAACrrACTTCCCTGACATTTC 

CCTGGGTTTGAGTGCCATAAATACTGTATTTCAGCTGTGTAGCCCTGGCTAAACAACT^ 

AATGGGAAT7 1 AACATGAAAATTGCTGGGAAGATGGGAAGTGTAAATAAGAAAAT 

AAAATATTAGTAATAATTAGAATGCATGGGAGCCTCACATTAAATTGGTGAGAA 

AACCCTTAGGTGATTCCCAACCCTGGCTTCCCCCTAGAAGTACCT 

ATTTAGTTGGTCTGGGGTGGAGCCTGGGCAGGTCTGACTTTTAGGGGGTCTCATGG^ 

AGTTCTAATTGGACGGTGTCCATGCTATACCAGCTGCTCAGTGTTTTGACTTT 

CCAACTGTTTCCCAGGAGCATCCAGAGTGGGGAACCACTGTGTTCA^ 

GAAATGCTATGAATTACAAGTGATTCCGATAATCTGTTTTGCCATTTTAGATGATG 

TGTCCAGAATGGAATCTGTTGAAATCATAAAAATTACATTCT 
CAGTATGATTTCAOTrCTCTCTTCTAAAAACT 

AACACTGCATCACAGGTACTGAAGAAATAACTGAAATATGCAGGCAGATGTTCTCATAATAGC^ 
CTCCCTTACTCAGGGTAATAGACACC/7ITCCAAAGAGGAAGGACCTCCT 
GTTTCGGGTCACTCTTACTGGAAAAAAAATAAGCTATTCCTGTGTCTTACAATTTTGAGA 
GCAAATGAACCATACTGCTACTTCCCAGATTAAATATCTATTATGATGTTGCCTTTTTA^ 

ACTTTCATTTTTATATCTTTACAAATAAATCT 

TACCATATGTATTCTGCACTTTAAAAAATTTTTAAATTTACCCCTTTTA^ 
CATTTTTTTC' 



:tttcttaaatgagagaaa 
tatctttgtcttctctagccctttgcactcttactctgttacrgcccttctattc^ 

C CACATCTTAAACACTATAATAGAAACATGGTTT ATC TAT ATAGGATTAT AAAT AGACCAGCATTCAGCATTGAC CTTTATTTTAAGACAAC 

ATGGCTGTTCTCAAGTGTAAAATCTCCCTCCCTGGCTAGGGCTTTAGAGCAT^ 

TAGCACCTGCCTATTAAAGCTAATTTTAGTGCCACCATTGTAAACCACCTCCTACT 

ATGTCTGTGTATCATATTTTGCAITGAGATTTGCTTTTTTGTTrc 

ATTTGGGTTTTAGTTTGTTGCAGCAGGTAATATATGTCATGCCATCTAGA^ 

GATACCTAGGAGAAGATTCATGTTATTTGAGCCTAATGTTGATTAATTA 

CTCCTTCTCTTACTACCCAGCCAATATAATTCAGTATTGTTrGATCCCAAGACCTAGG 

ATGTATACATTTATGTATATATACTTTTTTATAAGT^ 

CAAGTTAGGTTAAGCCATTTTAGTTGGTGAAATCAGTTTGA^ 

TTATTGAGGTATAATTTGCAATAGCAGAATGCTCAAACATGAATTGTA^ 

CAGGCTAGCTTGTACCTGAGACCCTCTTTATTTTGACCTCCATCAC 

TACATGTACTCTTTGTGTGAGGCTTATTTAGCTAAACATGTGATTCACT^ 

AATCCCAGCACTTTGGGAGGCTGAGGTGAGCGGATCTTTTGAGGTTAG 

TACTAGAAATACAAAAATTAGCTAGGCGTGGTGSCAGGTGCCTGTAATCCCAGCm 
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2 3001 GAAGGCAGAGATTGCAGTGAGCTGAGATCATGCCACTGCACTCTAGCCTGGGTGACAG^ 

2 3093 CGTTTTAATGAAATAAAATGGAATGGAGAATATGAAAGTACACTGCCCTTAAT^ 

2 318 5 TTTGOTGGCACCTGCCACCATGCCTAGCTAATTTTTGTATTTCTAGTAGAGACAGG^ 

2 3277 CTAACCTCAAAAGATCCACCCACCTCAGCCTCCCAAAGTC^ 

2 3369 AGTGAATTACATAATTTTTTACATAAAATAATGTCAT^^ 

2 3 4 61 GATATAAAAGATATTTGTTTTTCTTAGTGTGCTATTATGTATATTTATTTACTTTCATTGGTATATAATCT 

2 355 3 GATACTTTGATATCTGTATACAATGTGTGATGATCAAATCAGGATAATTGGGATA 

2 3 64 5 TGAAACATTTCTTACCAGGAGTCATGGTCAAAACCTGAAAAA^ 

2 3737 TACAAATACTTTTATTTAGGAAGGTAGAAAGGTGGAAAGTAATTTTTTGAAGGGGAAAAGAAT 

2 382 9 GCACATGGGTATCTGTGGGCTTTGCCTTTTAAAGCCCAGCTTGGTGTCAGTGTGAG 

2 3921 CCCCAGCCTTGAGCAAAAATGCAGTTTTGGCTGTTTATCATCCCCCTTCAGGGTGTCTGAAC 

2 4 013 TGACCTGATAACTGTTGGTCATCCCATTAGGAAGGATGGATTCCATGGT^ 

2 4105 AGAGAACTGTGCAAGGAGTTTACCCAGGCTATGAAGAGCTAAAGAGGTC 

24197 CTGGGCCACAGTTCAGTAAGATTACAAAGAAGaXCTAGAACAATGAQ 

2 4 28 9 TTTTCTAATCAGCTCCTCTAACCTCCTTCATCCTGTTGCCCCGGCTTTTGTTTTCCACTGTGACTAAG^ 

2 4 381 ACAGGAAGTTGTTTCAGTGCAAAAATAACTGATGTCTCATTCTGGAAT^^ 

2 4 4 73 AGGTTTTCAGAATATTTTGTATAATCTTGGAGCTTATGTTTGTACATTTACT 

2 4 5 65 TATTTACCCTAAAACTGGTTCTTTTCCTCCTATTGTCTAT^^ 

2 4 657 T AT AAAT ATTTT AGTTT GAT ACACAAAC C GT AGTTCAAGTT AGTT AATTGATTT CTTC C CT AGAAAGTCAAGGAGT AACATAAT CAGGTT AT 

2 4 7 4 9 AAACTTCATTACTAGTTATTTAATAATTTATTTC 

2 4 8 41 AGTGTGAAGAAAACCCACCTTATGTTTTCTTCCACAGCTTTTCTGTIT 

2 4 933 CTTTGGTTTCCCCATGTGGTTCTGAAAGAGAAGTAGAATTTCTTC 

2 5 02 5 GCAAATGAACTTATTGTTCCAGGTAAATCTTCCACAGTTGCATGCAGGGGAAAGTATGATGT 

25117 TGAGGATCAAGGGCCATGCTCAGCAGAACTTGTCAGGACCCAGCAGTTTCACGG^ 

25 2 09 TGCTTTCTTCCCTAGATTCCAATCCAGAAAACAATATCATTGCACTATTATACAAAGGAGCTGGCTA 

2 5 301 TGTTGCATTTCTGGGCCTCCTTTGTGAAGAGGATGAACTGATGGTCCTGAGAAGTTAGGTGTCTTGGAAGTACT 

25 393 CTCTTTAAATTAAAGATTATATATTTTGGCCTCA^ 

25 4 85 ACACATTGTTTCTGATAATTCATCCTCAGAATAAGATGCTGTTGG^ 

25 577 AAGACACAGAGTATGCCTAAAGCCTACAGCAGACTTTCTGGAAACTCTTGAATGTTTGGTCCA 

25 669 CTTGTCAGGGTGTGTCATTAGTGCTTGAATGTAGGGTTTACAGGATGQ 

25761 TCCTTGTCTTCTCAGCAAACACCAGTTTCTACAGAGAACAGCTCTGCC^ 

2594 5 AGACTGACCACAGGTTATTTTAAGAGCAGAGCTGGTT^ 

2 6037 TGTATTTTCTCCACTCTTAGATTGTGAAAGTACATATTACAAGTATTTATTTTATTATCT^ 

2 6129 TGCCGCAATAAGTAAAAATACCCAAAGTTGTTTAAAGAAAAGTTCACCTTTTC 

26221 TTAATAACCGGTTGCAATTCCCCTTTCACCAAACTGATTGCTCA^ 

26313 AATATGTTCATGATATATACATTATTCTGAAATTTTCTGTATCACTTAAAA 

264 05 TCCCCCTTACATACATATCAGCAAGCTGCATGATATTTCAACTATTGCTACTG^ 

2 64 97 AGGTTTTTTTTTCAGTTTTTAGCCTCTACAAACACT 

2 65 8 9 CCCCAATGTGGAATTTTTAAGTCAAAGTTTATTGTGTTTTTAATG^ 

26681 ATTTCTGTTTCTCTGCATCTTCACCAGACGAGTGTAAAATGGTATTTTACTCT 

26773 . CATATTTTCATATAriTATTTGCCATTTGTGTTTTTTCTTCCTGA 

2 68 65 GTAATAGTTTAGACTCTGAAGCCAGGCAACCTGAGTTAGAAGCCAGGCCTCTATTTCATGATCT 

2 6957 ATGCCTTAGTGTTTTCTCTTTTAATGAGCAGGGATAATAATAGTACCTGCCTCCTAAGGTTGTATA 

2704 9 ATCTAGCAGGTAGATATTGGCTATTATCAATAGTAGCTCTTATC 

27141 CTACCCCTGAACCACATTTTTCTACCTCTTAGATTTTA^ 

27233 TAGACCTTAGGAGGGATGGGAGGAACTTTAAGAAGC C GAATTTGC TTTTT ATTTATCTTGT AGCAGAGCAATAGATGTAT ATT AGGTAGATT 

27325 ACAAGCTTTTAGGTTATTTTTGCATCTAAAGCTGTCCCTTCTTT^ 

27 417 GACAAAATCAGAATGCTTTGTGTCTATTTTGGCTAGTAGTO 

27509 CGAGCTCATTAGCAGTTATTCTTGCTTTATTCATTTCTTATCT 
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TTCACA^GTAGAATAGATAACTGTTGCT 

AAGCAAGTTCTTTGGGTTCAGTTAAATAGATTCCACTTG^ 

AGAATAAAATGAACTGGTCCCATCTCCC^TAT 

GCAGGAACGCCCCTTCACCTCCATAGCCTTCAGGCTCCCATCGAT^ 

GTTTAACATTTTTACCTGGAAGTGAAAATGGAATC 

TCTTAGTATTCTGGTCCCCATTITATATAAACATAGACAG^ 

TATGGCTAAGAAGATTAAGATGCTTATTTTCTCGTTATCGATG 

GACGCCTTTTCTGCTGTGGTTGTGAGTGCCCCAICAAGTGCT 

TATTCCGACTCCTVTAGACTTAACTGTGGAGACTGAGAGTAGTG^ 

GCATGAAAAATCATAAAGATGGAGATGACTTCTATCTTGTGAATGTCTGAAAAGCCTCTCTCTCTTT 

CCCTCATCTCTGCCACCATCTGTCGGTTTTCTTCCGTGGCCTC 

CTCACTTTCACCCTCTCCCTCAGGACATCTCATCCACTCCCAAGGC^ 

TTAGCTGTCTATTGGATGTTTCCAGTTGGAAATrCCACAGGCATC 

TTTGGCAGATTTGTTTTTCCTGAGTTTTCTGTCGTGTTCATGGA 

CCCTGCCCGAGTGATGCTTCTGAAATGTGGAACTCATTTTATTCTC^ 

AATCCTTTTTTTTTTTTTTTAAGTGAAAGCAAGTTTATTAAGA 

GCTGCTGGTTGCCCATTTTTATGTTTTTTTCTTGATGATATGCTAAACA^ 

CATTTCTTTTCACTTTTTTTTTTTAATCATGCCCTGCCTTT 

ACCTGCTGTGCCTTCTCCTAGAATCXIGCTTTTCTCC^ 

CTGACGACCACCCCCTAGTCCAAGTCAGCTCCCACTGTACTTTAAACTTTCTCriXrrCTTCCTTAT^ 

CTGGTGTTCCTTGGGACTAGGGACTTCCTTCATTCACATTTCACATAACTTGAGGGCCTGGCT 

AATTAATTAGCATCTTGTCCTTCAAGATCAGCCAT 

AGTCACAiGACTGGACTCTATTAAATCCTGTCTA^ 

GATGTTCAGCTCGATTCTGCCCCTTCATTCCAAGCCTGTCCCATAl^ 

TCTCTTTTGCTITCTTTGTGTTGACTTTGACTCTGCCACTGGTTGTGAGCTTC^ 

AGTGCTTTCTTTGTGTGCTGCACACTCCCTGGCACACACAGCGGCTC 

GGGCAGCTTTTAAATGTATTTGGGC^CCTTTGCAAG^ 

GCAGAAATCAAACTGAAAAGTGAAAGCAGAGTCAAACTTTTTTAIT^GGACC 

AAACCAAATGTGGGAGAGAAAATCATCGATGGGCTTATTTGTTTATTTGTTTGCT^ 

AAATTTGGGGAAAAGAGGAAAAATAAAAATGTATAATCTTATCACCATAGCATTACTATTGTGA 

ATTGGGAGAGATTTATTCAAAGACATTCTCAAGTT 

TGCAGTTGCCTGGGAGTCAGTGATAATTCCCGACTAGCC 

TTTCTGTTTCCTCACAAACCTCATAACCATACGTATGTGTGTTTATATGTCTATAITTTT^ 
GTAAAATATACATAATATAAAAXTTACCATTTTAAC 

TTACCACCATCCATTTCCAGAACTTCTTCATTTTCCCACACGGAAACTTTGTATCAAATGATA^ 

AGTAACCTCTGTTCTACTCTGTGAACCTGCCTATTTTAGGAACCT 

AAACTTAACATGTTTTCAAGGTCAATCCATGTTGTAGCATCT 

CTACATTTTATATATCCTTGTAAATCTGTTGATGGAGACTTG^^ 

AGCTCTGTATTTTTTCAGTTCATCCATTG^U^AGGTATACCAT 

TCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCAATCTCGG^ 

CCTCAGCCTCCCGAGTAGCTGGGACCACAGGTGCCCACTACCACACCTGGCTAATTTTTTTGTAT^ 

rTAGCCAGGATGGTCTCGATCTCCTGACCTGGTGAGCCGCCAGCCTCGGCCTCCCAAAGTGCTGGAATTACA 

GCCCATGTCTTTGACCATTGTTATAAACTATGTGTGTAACTACTATAAACCATAGAAACCGATTATATAA 

TAAGTGTATATAGCTTITCCATATTTTATTCCGTTTCC^ 

TTACATTTTCTTTTAAAGAAGCTTGACTTCTCATGAGCT 

GTGCAATGATTTATCTTTCAATTTGCAATGCTGTGTTGCCG^ 

ATTAAAAAAAAAATAGGCAGAGGTTTCAGATGTACCTTTA^ 

TTTGGATCAGGATATGGAGGTTGTTAGCCTGCAAGGACAGGATGTTC 
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3 2 2 01 GACAGAGTTCATCTTAGATGCTTCAlGGGAACGCAAC^ 

32 293 TTTTGTTTGTTTCCTAAAATGTTTTATAAGCTAACT 

32 38 5 TAGGTCTTGTCATTTCTCTTTTGATAGCAACATTC 

32 4 77 ACCTTTCTTTGTTACTCTATCCCTGTTTGACATAAAATACAACC 

3 2 5 69 GGCAAATGCTCGCCACCACGTCCCCGGCGCTGATGAATGGCAGTGGGCAGAGCCCATCTGTCCTCA^ 

32 661 CAGTATCCGAAGCAGGTGGGCAGTATGAGCCCAGCATTCCCACTAC 

32 75 3 TTAGACCTTGTAATGCACAAGTGGGGTCATTAGACTCTT^ 

32 8 4 5 TTTAATTTCCTTTCTTCTTTCTAAAAGTTCTCCTAGTTAC^ 

32 937 CTAGTTTGGCACTGGGTGCTTTTTACTAGTTGTCCTGTTTC^ 

33029 TTCTGCTTACTGAAGAAAATTGTTTGCCTCCTAGGGAATATTTT^ 

3 3121 AACTCCTTCAGGGGAGCATCACAC^TTGCGCTGAGCCATA 

33213 GGTAAAAAGCAATCTGAAAAGAGGCCTTTATGTGATACTATAAATCCTT^ 

3 3 3 05 GAATATGCTACCTGTA3XTACTCTGAACTTTATGTCTTGATTTTGAAGGAAATGTAGTCCTATGTCT 

3 33 97 CAGTTCTGACATCCTAAAATAATTTGCAAAGGAATTACCAGCTTAATACTAAACTTT 

3 3 4 8 9 AGTTTCTTCTATCTGTTAATTTGCCTCTTGGGTTCTGAAATTCTATTTTGGTCCACTTACACTTA 

3 35 81 CCTTGGCTCAGATGACCTGGCCAGCAGTGTCAGTGATTCAGGTCCACTTGGTTTTTGCTAGAAGGGG 

3 3 673 GAAAACATACATGATGTATGTTTGGTTTTTTTCAAAGTAGTG 

3 3765 TATTAGAGGTCATCTAGTCCAAGATGTGCTTTCATATTTCAGGACAC^ 

3 394 9 ACATATTTTTAGTTATTTAAAAAAAAAATTGTGTGGTCATCTGAGAGGCCTGAATTCAC 

3 4 04 1 CCCAGAGGCTGTGAGTGAAGTTTACTAAGTTGGATTCAGAGCT^ 

3 4 133 GATGCACGATGGCCATTGAGAAATGGATTTTGGAGTCAGAAGACCTGGG^ 

3 4 2 25 CACCTCTCATGGCCTCCATATGTTCCTTCTGTGCATGAAGGATGATGTTACTTCTTGCCTCTGCCTTCCTCATAGGGACAG 

3 4 317 AACAGATCATGTATGAGTCAGTGCTGTGGGCACCATAAATCACAGAAAGCCCAGAAGACATCGTG 

3 4 4 09 GCCCATTTACCCAGGCACTVrTGGTTCCAACAGTAAGCCTTTTTGGCTGAT 

3 4 501 TTTTTTTAAACCAAGTCTGTAAAACCTTGGATGAGAAGCTCTTTTAGC 

3 4 685 GGCAGCTCATATTGGTGTAAAAATCACATATCACTGTAGGCTAAACTTACCTCTGCACACTCCTCCATGTCC^ 

3 4 777 TGCTTTTTCTTCATTTTTTTATGGAATGTAAAGCTCATCCATGTG 

3 4 8 69 TAAAGCAGGAATTAAGGCTCAACTATCITACTTTAGCACA 

34 961 TTTCTTCATGTTTTCCAAGATGGTCTAGAAC^TCATTTAGAGTAA^ 

35053 CCTGTGAATAGAGGTTTTAAAAAGAAAAAGAAGGO^AAAAAAGCCCAAAGGG^ 

352 37 TGAAATTACTGCATTTTACTCCACAACATATTCATCATTGTTAGAACCAAAAAA 

3532 9 GCTAGGTAGGCCATTGTACCCTATAACTCATACACCTAAGACCCCACTAAGTCGCCCCACCCAGGGAGGCAAGAA 

35 4 21 GGGGGGAGGTCCTAGATGATTCCCTGACATCTCITCCAATTAGGATGTAITTTCCAGGTTACTCAT^ 
3 5513 TATGGTAGGATTATAAAGTTTTGCAACTGAAAGGGATCATGAAAAGTATAGCCT 

35 605 GGTTTGATGGCCTACAGAGTCATAC^GCTGGATTCAGTGTGACTTAGAGACAGACAGCTCT 

35 697 CCTTCCTGTGGCTGATGGTACACACAGATCACCGGAGGTCTTGTCAGAACGCACGTTCTGATCCAGGAAGTTTG 

3578 9 TGCATTTCTACAGTGATGCTGATGCTATAGCACACTTTGGTTTTGGAGTACATTTCCCAAAATTGCT 

358 81 GACCCTTCAGTAAAAAAAAAAAAAAAAAAATTAAGTAATGTAAAAGACCC^ 

35 97 3 AAGCACTATCACAGTACAGACAAATTTGGAAAAATTAAGCATTATCTTTTTTAGCAATAGGGGTATAGAT 

3 60 65 GGCTAAAAGCAAATAAGCTTCATTGTACACTGTGAATTAATCAA^ 

3 615 7 TAATTCATGTTTAGAAAGCAAGAATATAGCATTGA 

3 62 4 9 TCAGCATAAGCTAGTGCGCTTTAGGAGCTGTGAAAGCTTAGTATTTTAATTAGTG 

3 63 41 ATGGATACCAAATAGTAATTATTAATAACTCAGTAGACTTATAATAAGTAGC^ 

364 33 GTTTGTTGCCCGTAAAGACCAAAGAAGAGACATGGATTTTGGAAAGTC 

3 652 5 ATAAAGTACTCTGCACAGAGTGTGAATCCAGCCATATTCATTACTGTATGATGTGCTrTTCTATGCA 

36617 TAGTGAGCATGTAAGGATGGCTCTTGAGGTCACAGTTCTTTCAGTGAG^ 

3 6709 AGAAGGTGCTGAAGAATGCCAAGCAGGCATGC CAAAGAC T AGGACAGT ATAGAATGC CATTTGCTTGGGCAGCAAGGTAAGGAACACCTTTT 
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ATACCTTTTAAATCGATATAGATAGGTGCATGGATGGGTCAATAGGCCTAT^ 

AACTGAC^WVTACATAAGCCCAGATTTTGAAAAA^^ 

TGGCAGGGTGCACGGTAGTGTTAGCTGCAGAGCT 

CTGGGGTGATCTTTATCTCTTGACTCTACTTGTTAAGCATTTGACTTGTGTATATTGTTTC 
CTATGTAAATTGATTTAGTTGTCCTCATCC^ 
TAATTAGACTTCTCATAGAAACTAGACTTAAATAATG 
GACATTGTTTAAGGATGCATCTGGAAATCTTG^ 

ACATGCTCAAGTTACTTGCAGACTTTCGGAAGTGAGTTTCAAGGTCTT 
AACAGGGGAGCAGTCACTTAGGTTGCTCGATTTAGACATCAG^^ 
GATGGGGTGAAAATCAGAAGAATAACCAGTTAATTTGAATTCTGTAGAGGATGTT^ 
AGCCTATGGTGGGGAGGAAACAGTTGAGGAACCTTGTCAAGAGGTGAGAAAGGACT 

GACAAATTCAAGTCGTGCCAAAGAGATACGATGACTGGGTCTTGGCTTGGGGTGGTAATAGTCTAAGATAAATA^ 
TTGGAAGTTTCTGGCACCAGTTGTTGCTTAGCTTGTGGCAGCATTGCTCCACTCTGCCT 
TGTCTAAATTCCCTTCTTAGAACACTAGTGATAAAGCATCAG^ 
TTTCCAAATAGGTCACATTTATAGGTACAA&GGGTTG 

TGCTGCTACATCCTAAAGAGTTCACTCTGAA.CCTTTGAAACTGATTTTCTTTGCTA 

GGGAATGTGAAAGAGCTGAGGGCGCTAGAAGATGTGAAGTGAAAAG^TAGCTGAGAGCCAAA 

TGTTTTTTTTTTTTTTTGTGCATATCAAAATAGCA^ 

TTACCTTTTTCTTATCTGTCTTCAGTAGTAGGAATAGAAACGATATGAGTCATAGAA^ 

Gyw^GGATTCTCAAGGTCTCTTTCAGTTATGTGATTATACAGTTTTTGACTGTC 
AACCTCAGTTTTGCTAACATGCAGCTAAGGTTGGCCTGTTCAGCAAAGCAGTGTGCATGC 
GCTCACGTAGGGAATTGGAGAAGGGGGAGAGGAGGATACTGGTGAAGGAXGAGGCCTGCTGGGTTAGCCTTC 
GGTGCCCCAAATTCCCAGTCACTATCTGACAGTTTTATGAC^^ 

CTGTTATTCATGAAAGACTAAATAAAAGAATAGTACCCTATTTTTACTTTTAAATCATAG^ 
CATTTTAGAAATTTTGAGAAATACAGAAGAATAAAAGGATGAAAAAAGGTT 
TTTTTTTTTTTTTTGAGATGGAGTCTCACTCTGTCC^ 
TTGAAGCGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTATAGG 

TGGGGTTTCACCATATTGGCCAGGCTGATCTCAAACTGCTGACCITGTGATCTGCCCGCCrCAGCCTCC 

GAGCCACCGTGCCCAGCCTAGGGGGGAACATTTTTTTTtACGTTTTATTCC 

ACTAAATATGTAATATTTCCCTGCACAACTCAAGTAT^ 

TTATARTAGTCCATTGAGATAGACCATAGATTATTTAACTCTTCCCCCATTTTTTGAC 
AGCTGGGACTCCTGAGTAGCTGAGTAGCGCATGCTGCCAC^ 

TACTAAATATCTCACCATCTTGCCCAGGCTGGTCTCAAACTCCTGACCTCAGGCAATCTGCCCGCCT 

AGGTGTGAGCCACCATGCCCAGCCATTTTTTGACTTTTAATGTGTTTCTGA^ 

AAAAAATCTTCTCTATTGGTAGTGGGTAATATATTATCATAC^ 

ATTTCATTCTTACTAATTTTTTCAAAAACCAGTCACCTTTAG^ 

TAACCATGAGGTGGGTCTGCGTGTACTTAGGAAAGTGAATAC^ 

AAGAAAATATATATTTCTTTTGTATGTAAATGAAGAAATGGATAAGCAAGTAGCTATCTAGATGGAA^ 

GGATATATGCCAAATAATCATGGTTATCTCTGAGGGATGGGTTGATGGGTGATATTTCACCTTCCACTTTAT 

TGAACTTTTAAAAAACTAACACTTTTATATTC 

TTACTTCCTAGGCTGGTTAGCTTGAGACTTCCCCACAGTGGCGGCTCCTCAGGGGCAGCCCCAGTGCATGGTCCTG 
GGGAGTGGGGCITCACATGGTCACTAATTTGAAAGTGATGGGAG^ 

AGAAACCTGAAAACAGGGATCCGATGGTGACAGCATAGAAGACA 

TTGGTGCCCACTCCCCGTATTCTTCACAACAGAGTTAGGGGACGTGGAGGATTCCT 

CTTTCCTCTGTATATTTTACAGGAAATAATCT 

TCTCAAAATGTTATTTAATAATGCATGAAAAAAATTTCTTCACGCTCT 
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414 01 CTTTTTCCCTTGCAATAATTACCCAGGGATATGTTCC^ 

4 14 93 AGAAGAGAGACTTGGTACTTTGTATTTGATTTGTTCATGGTGGTATCTCATGGA 

415 85 TGCGAAGACATGAAGGCAAAGTATAAAAATAGAACGTTTTCTTTAAAACGTAGACCTTTT^ 
41677 ATACTACTAAAGTTTTTGCGTATGCAGCTTAATGTGTCT 

417 69 GATTTCGCTCTGGTTACACTGCACTCAAGCCAAGTAGGGCTGCTTGACTTTCTCTAAA 

418 61 TCTGCCCAAArTGTGTCCCCTTCCTGCCTCATCAAGCA^ 

4195 3 TCTCAGGGAACTCTACTCATCTTTCAAAGCCCAGTCTGTGGCTCACTTCTGTGCTGGGAGTCCTGGAGGCGG 

4 204 5 AGCCGCCTCCTCTTTTTAAGGGTGCATAATAA^ 

4 2137 TCCCACCATGAATACTGATCTCATCCCGTGTTCCCTCTCGATAGA^ 

4 222 9 GTGATTGCCCTCATATGCCGTAAGTAGCTTACAGTGTCTACTGGAC 

4 2321 ATGGTCCTGACTACATACATTGCAGCATATCCTA^ 

4 2 413 TGAAACAAAAATATCTCAAATTTCTTTCAATCAT^^ 

4 25 05 AACACAGTTAGAAGATTAAACTCACCACCAATAGCAGTCCAAAC^ 

4 25 97 AGGAAGGGAAGACCGGCTGCAGAAACACTTAGATAAGGACC^ 

4 2 68 9 CCTCTGTGGGAATTGTTGAGTCAGCAGAAATGGATGGGCAGT 

4 2781 ACACAGAGACAGGAGCAGTTCCCAGAGGCCAGGCATCCTGCAAGT^ 

4 2873 TCATGTTCAAGGATTATTTTATAAATTTTGCATAGAATA^ 

4 2 965 AAAGAATTCAGTTTGTGTAACACGCACAa^ACCACCACTTTGGAAGTGCATAA 

4 3057 TCTAATAAACCAGATTCAACATAAACCATAAACTTTTGAAATGGGTTTG^ 

4 314 9 ACTTTTAAAAGCCCATGTGCTACATAATATGGAACTAAACTCAGAAATGTGCTTGGAAACACAT 

4 32 41 CTAC^GTAAAATCTCTCAGCAGAGGGAGGAAATAGAATAAGAA^ 

4 3333 CCAGTGAAGATCATTTAATTAAAATATGTTGCTTAGAAACGTATTTTAATT 

4 3 4 25 GAGTGATCTTAAAAATGGTGATGAAGATGCCTGTTCATTCATA 

4 3517 TTATATTTAAAAGTATAATTTGTAATAAA 
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hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



M FPKE D I S I SV I GRQRRTVQ 20 

MTHLNSLDVQLAQELG 16 

MAERRAFAQKISRTVAAEVRKQISGQYSGSPQLLKNLNIVG 4 1 

MLLFPYDDFQTAILRRQGRYICS 2 3 

. MAASERRAFAHKINRTVAAEVRKQVSRERSGSPHSSRRCSSSL 4 3 

MSFRGKVFKREPSEFWKKRRTVRRVIQEEFHRFSSQEKPRLLEPLDYETVIEELEKTYRN 60 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



STVPEDAEKRAQSLFVKECIKTYSTDWHWNYK 53 

DFT 19 

N ISHHTTVPLTEAVDPVDLEDYLITHPLAVDSGPLRDLIEFP 8 3 

TVPAKAEEEAQSLFVTECIKTYNSDWHLVNYK 55 

G VPLTEWEPLDFEDVLLSRPPDAEPGPLRDLVEFP 79 

DPLQDLLFFPSDDFSAATVSWDIRTLYSTVPEDAEHKAENLLVKEACKFYSSQWHVVNYK 120 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



YEDFSGDFRMLPCKSLRPEKIPNHVFEIDEDCEKDED- 

DDDLDWFTPKECRTLQP-SLPEEGVELDPHVR 

PDDIEWYSPRDCRTLVS-AVPEE-SEMDPHVR 

YEDYSGEFRQLPNKWKLDKLPVHVYEVDEEVDKDED- 



-SSSLCSQKGGVIKQG 105 

DCVQTYIREWLI 63 

DCIRSYTEDWAI 12 6 

-AASLGSQKGGITKHG 107 



ADDLELLLQPRECRTTEP-GIPKD-EKLDAQVR AAVEMYIEDWVI 122 

YEQYSGDIRQLPRAEYKPEKLPSHSFEIDHEDADKDEDTTSHSSSKGGGGAGGTGVFKSG 180 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



WLHKANVNSTIT — VTKKVFKRRYFYLTQLPDGSYILNSYKDEKNSKESK-GCIYLDACI 162 
VNRKNQGSPEIC--GFKKTGSRKDFHKT-LPKQTFESETLECSEPAAQA — GPRHLNVLC 118 
VIRKYHKLGTGF — NPNTLDKQKERQKG-LPKQVFESDEAPDGNSYQDDQDDLKRRSMS I 183 
WLYKGNMNSAIS — VTMRS FKRRFFHLIQLGDGS YNLNFYKDEKISKEPK-GS I FLDSCM 164 
VHRRYQYLSAAY — SPVTTDTQRERQKG- LPRQVFEQDASGDERSGPEDSNDSRRGSGSP 179 
WLYKGNFNSTVNNTVTVRS FKKRYFQLTQLPDNSYIMNFYKDEKISKEPK-GCIFLDSCT 239 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



DW<^PKMRRHAFELKMLDKYSHYLAAETEQEMEEWLITLKKIIQINTDSLVQEKKETVE 222 

DVSGKGPVTACDFDLRS LQPDKRLENLLQQVSAEDFEKQNEEARRTN RQAE 169 

DDTPRGSWACSI FDLKNSLPDALLPNLLDRTPNEEIDRQNDDQRKSN RHKE 2 34 

GWQhmKVRRFAFELKMQDKSSYLLAADSEVEMEEWITILNKILQLN FEAAMQEK 219 

EDTPRSSGASSIFDLRNLAADSLLPSLLERAAPEDVDRRNETLRRQH RPPA 230 

GVVQNNRLRKYAFELKMNDLTYFVLAAETESDMDEW I HTLNRI LQI S PEGPLQGRRSTEL 2 99 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



TAQDDETSS QGKAENIMAS LERSMHPELMKYGRETEQLNKLSRGDGRQNLFS FDSE 27 8 

LFALYPSVD EEDAVE I RPVPECPKEHLG N RILVKLLTLKFEIE 212 

LFALHPSPD EEEPIERLSVPDIPKEHFG QRLLVKCLSLKFEIE 277 

RNGDSHEDD EQSKLEGSGSGLDSYLPELAKSAREAEIK LKSESRVKLFYLDPD 272 

LLTLYPAPD EDEAVERCSRPEPPREHFG QRILVKCLSLKFEIE 27 3 

TDLGLDSLDNSVTCECTPEETDSSENNLHADFAKYLTETEDTVKTTRMMERLNLFSLDPD 359 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



VQRLDFS GIEPDIKP-FEEKCNKRFLVNCHDLTFNILGQIGDNAKGPPTNVEPFFI 333 

IEPLFAS IALYDVKERKKISENFHCDLNSDQFKGFLRAHTPSVAASSQARSAVFSV 2 68 

IEPIFAS -LALYDVKEKKKISENFYFDLNSEQMKGLLRPHVPPAAITTLARSAIFSI 33 3 

AQKLDFS SAEPEVKS - FEEKFGKRI LVKCNDLS FNLQCCVAENEEGPTTNVEPFFV 32 7 

IEPIFGI LAL YDVREKKK I SENFYFDLNS DSMKGL LRAHGTH P AI STLARSAI FS V 32 9 

I DTLKLQKKDLLE PES V I KP FEEKAAKR IM I ICKALNSNLQGCVTENENDPITNIEPFFV 419 
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hCLASP4 NIJU.FDVIOJNCKISADFHVDLNPPS'/REMLWGSSTQLASDGSP KGSSPESYIHGIAE 390 

hCLASP5 TYPSSDIYLWKIEKVLQQGD IGDCAEPYTVIKESDG GKSKE-KIEKLKL 3L7 

hCLASP3 TYPSQDVFLVIKLEKVLQQGD IGECAEPYMI FKEADA TKNKE-KLEKLKS 382 

hCLASP2 TLS LFDI KYNRKI SADFHVDLNH FSVRQMLATTS PALMNGS GQS PS VLKG I LHE 381 

hCLASP7 TYPS PDI FLVIKLEKVLQQGD ISECCEPYMVLKEVDT AKNKE-KLEKLRL 378 

hCLASPl SVALYDLRDSRKISADFHVDLNHAAVRQMLLGASVALENGNIDTITPRQSEEPHIKGLPE 479 



hCLASP4 SQLRY IQQGI FSVTNPHPE I FLVARI EKVLQGNI THCAEPY IKNS DPVKTAQPCVHRTAKQ 450 

hCLASP5 QAESFCQR LGKYRMPFAWAPISLSSFFNVSTLEREVTDVDSWGRSPVGERRTLA 372 

hCLASP3 QADQFCQR LGKYRMPFAWTAIHLMNIVSSAGSLERDSTEVEISTGERKGSWSERR 4 37 

hCLASP2 AAMQ YPKQG I FS VTC PH PD I FL VAR I EKVLQGS I T HCAE P YMKS S DS S KVAQKVLKNAKQ 441 

hCLASP7 AAEQFCTR LGRYRMPFAWTAVHLANIVSSAGQLDRDSD SEGERRPAWTDRR 429 

hCLASPl EWLKFPKQAVFSVSNPHSEIVLVAKIEKVLMGNI ASGAEPYIKNPDSNKYAQKILKSNRQ 539 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



VC S RLGQYRMP FAWAARP I FKDTQGSLDLDGP 

QSRRLSERALSLEENGVGSNFKTS 

NSSIVGRRSLERTTSGDDACNLTSFR- 
ACQRLGQYRMPFAWAARTLFKDASGNLDKNAF 
RRGPQ — DRASSGDDACSFSGFR-: 



FCSKLGKYRRAFAWAVRSVFKDNQGNVDRDSP FS PLFRQESSKISTEDLVKLVSDYRRAD 



FSPLYKQDSSKLSSEDILKLLSEYKKPE 
FFKQEGDRLS DEDL FKFLADYKRSS 
FFKQEGDRLSDEDLYKFLADKRRPS 
FS A I YRQDSNKLSNDDMLKLLADFRKPE 
FFKQEAERLSDEDLFKFXADMRRPS 



510 
427 
496 
501 
483 
599 



--KTKLQIIPGQLNITVECVPVDLSNCITSSYVPLKPFE-KNCQNITVEVEEFVPEMTKY 567 
S LQRRVKS I PGLLRLE I STAPE I INCCLT PEMLPVKPFP-ENRTRPHKEI LE FP — TREV 484 
SVLRRLRPITAQLKIDISPAPENPHYCLTPELLQVKLYP-DSRVRPTREILEFP— ARDV 553 
K-MAKLPVILGNLDITIDNVSSDFPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKH 560 
SLLRRLRPVTAQLKIDISPAPENPHFCLSPELLHIKPYP-DPRGRPTKEILEFP— AREV 54 0 
R- I SKMQTI PGSLDIAVDNVPLEHPNCVTSS FI PVKPFNMMAQTEPTVEVEEFVYDSTKY 658 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



C^PFTIYKNHLYVYPLQLKYDSQKTFAKARNIAVCVEFRDSDESDASALKCIYGKFOlGSV 62 7 
YV PHTVYRNLLYVYPQRLNFVN — KLASARNITIKIQFMCG-EDASNAMPVIFGKS 5GPE 541 
YV PNTTYRNLLYI YPQSLNFAN — RQGSARNITVKVQFMYG-EDPSNAMPVI FGKS SCSE 610 
TC PYT1 YTNHLYVYPKYLKYDSQKSFAKARN IAICIEFKDSDEEDSQPLKCIYGRP oGPV 620 
YA PHTSYRNLLYVYPHSLNFSS — RQGSVRNLAVRVQYMTG-EDPSQALPVI FGKS 3CSE 597 
CP P YRVYKNQ I Y I Y PKHLKY DSQKC FNKARN I TVC I E FKN S DEES AKPLKC I YGKF SGPL 718 



FT T ^ A YAWS HHNQN PE FYDEIKIELPL HLHQKHHLL FT FYHVSCJ 
FLQI VYTAVTYHNKSPDFYEEVKIKLPAKLTVNHHLLFTFYHISCC Q- 
FSK3 AYTAVVYHNRSPDFHEEIKVKLPATLTDHHHLLFTFYHVSCQQ- 
FTRJ AFAAVLHHHQNPEFYDEIKI 

FTRi AFT P WYHNKS PE FYEE FKLHLPACVTENHHLL FT FYHVSCC 
FT S; AYTAVLHHSQNPDFS DEVK I ELPTQLHEKHH I L FS FYHVTCE 



INTKGTTKKQD1 VE 
KQGAi V 



IELPTQLHEKHHLLLTFFHVSCI NSSKGSTKKRm VE 



687 
VE 595 
664 
680 
651 

INAKANAKKKEALE 778 



TLLGYSWLPILLNERLQTGSYCLPVfl LEKLPPNYSKHSAEKVPLQNPPIK 



TPVGFAfrTV'PLLKDGRlITFEQQLPVS^LPPGYL^NDAESP^QCNVlfiKWVDGAKPLLK 74 7 

IKWAEGHKGVFN 655 
LPGpOCWVDNHKGVFN 721 
74 0 

IPLLQHGRLRTGPFCLPVsjVDQPPPSYSVLTPDVA LPGkRWVDGHKGVFS 708 

C IKWVDGGKPLFK 838 



TPVGYTWIPMLQNGRLKTGQFCLPVSJLEKPPQAYSVLSPEVP-- 
TQVGYSWLPLLKDGRWTSEQHIP 
TPVGFTWIP 

TS VG YAWLPLMKHDQ I ASQE YN I P IAhTS L P PNYLS FQDSASGKHGGS dl 



IPVS ANLPSGHLGYQELGMGRHYGPE IKWVDGGKPLLK 
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hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



HCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



FKSHLEST I YTQDLHVHKFFHHCQLIQS - ■ 
IEVQAVSSVHTQDNHLEKFFTLCHSLES^ 
VEWAVSSIHTQDPYLDKFFALVNALDEH- 
ISTHLVSTVYTQDQHLHNFFQYCQKTES -- 
VELTAVSSVHPQDPYLDKFFTLVHVLEE 3- 
VSTFWSTVNTQDPHVNAFFQECQKREKD- 



GSKEVPGELIKYLKCLHAM 794 

TFPIRVLDQKISEMALEHELKLSI ICLNSS 715 
L FP VR I G DMR IMENNLENE LKS S I S ALNS S 780 

GAQALGNELVKYLKS LHAM 78 7 

AFPFRLKDTVXSEGNVEQELRASLAALRLA 7 67 
MSQSPTSNFIRSCKNLLNVE 887 



QLjEPWRFLHLLLDKL: 

MIAFLPTILNQLEJr 
S Pfe PL VAFS HHVLDKLV RLV 
KI^AIMSFLPI ILNQLF K- 



EIQVMIQFLPVILMQLER VLTNMTH EDDVP 824 

RljEPLVLFLHLVLDKLEQLSVQPMVIAGQTANFSQFAFESWAIANSLHNSKDLSKDQHG 77 5 
IJLLV I RPPVIAGQ I VNLGQAS FEAMAS 1 1 NRLHKNLEGNHDQHG 840 

VLT-RAT QEEVA 816 

IRPPIISGQIVNLGRGAFEAMAHWSLVHRSLEAAQDARG 82 7 
VLVQNE EDEIT 916 



INCTMV-LLHIVSKCHEEGLDS YLRSFIKYS FRPEKP 8 60 

RNCLLAS YVHYVFRLPEVQRDVPKSGAPTALLDPRS YHT YGRTSAAAVSSKLLQARVMSS 83 5 
RNSLLASYIHYVFRLPNTYPNSSSPG-PGGLGGSVHYATMARSAVRPASLNLNRSRSLSN 899 

VNVTRV- 1 IHWAQCHEEGLES — HLRSYVKYA YKAEPY 8S2 

HCPQLAAYVHYAFRLPGTEPSLPDGAPP — — VTVQAATLARGSGRPASLYLARSKS ISS 883 
TTVTRV-LPDIVAKCHEEQLDH SVQSYIKFV FKTRAC 952 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



SAPQAQLIH ETLATTMIAILKQS 883 

SNPDLAGTHSAADEEVKNIMSSKIADRNCSRMSYYCSGSSDAPSSPA 882 

SNPDISGTPTSPDDEVRS I IGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAESTQAMDRSC 959 

VASEYKTVH EELTKSMTTILKPS- 875 

SNPDLAVAPGSVDDEVSRI LASKLLHEELA- LQ 915 

KE RPVH EDLAKNVTGLLKSN 972 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



ADFLSINKLLKYS WFFFEI IAKSM 907 

APRPASKKHFHEELALQ MVVSTGMVKSM 910 

hmMSSHTETSSFLQTLTGRLPTKKLFHEELALQWVVCSGSVRESALQQAWFFFELMVKSM 1019 

ADFLTSNKLLRYS WFFFDVLIKSM 899 

WWSSSAVREAILQHA WFFFQLMVKSM 942 

DSPTVKHVLKHS WFFFAI ILKSM 995 

Cadherin Cleavage 



ATYLLEENKIKLE|RGQRt"PETYHHVLHSLLLAI IPHVTIRYAEIPDE SRNVN^jSLAS 964~ 

AQHVHNMDKRDSf RRTR rSDRFMDDITTIVNWT SEIAALLVKPQKENEQAEKMN1 SLAF 970 

VHHLYFNDKLEAIRKSRrPERFMDDIAALVSTIASDIVSRFQKDTEM VERLN1SLAF 1076 

AQHLIENSKVKL1 RNQR rPASYHHAAETVVNMLMPHITQKFGDNPEA SKNANT SLAV 956 

ALHLLLGQRLDTE RKLR fPGR FLDDI TALVGSVGLEVI TRVHKDVEL AEHLNJ SLAF 999 

AQHLIDTNKIQLE RPQR TPES YQNELDNLVMVLSDHVIWKYKDALEE TRRATI- SVAR 1052 



FLKRCLTLMDRGFI 
FLYDLLSLMDRG 
FLNDLLSVMDRG 
F I KRC FT FMDRG 



FpJLINDYISGFSPKDP 

irhycsqlsaklsnl- 
ikscykqvssklyslp: 
innyiscfapgdp- 



-ETLI 



FLSDLLSLVDRGFVF 5 LVTIAH Y KQ V AT RLQSSPNP/ ALLTLRMEFTRI 
FLKRC FTFMDRGCVE <MVNNYISMFSSGDL- 



^jVLAEYKFEFLQTICNHEHYlPLNL 1019 
ISMRLEFLRILCSHEHYLNLNL 1027 
VS LRLDFLRI ICSHEHYVTLNL 1136 
FEYKFEFLRWCNHEHYIPLNL 1011 
LCSHEHYVTLNL 1059 
HTLCQYKFDFLQEVCQHEHFIPLCL 1107 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



Cadherin EC m otif 

PMAFAKPKLQR VQDS - -NLE YSLSDEYCKHHFLVGi[lLr3tS I 10 60 

FFMNADTAPTSP--CPSISSQNSSSCSSFQDQKIASMFDLTSEYRQQHFLTGI LFTELAA 108 5 

PCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELSVPFRQQHYLAGI VLTE LAV 1196 

PMPFGKGRIQR YQDL--QLDYSLTDEFCRNHFLVGI LLREVGT 1052 

PCCPLSPPAS PSPSVSSTTSQSSTFSSQAPDPKVTSMFELSGPFRQQHFLAGI LLTE LAL 1119 

PIRSANIPDPLTP SES TQELHASDMPEYSVTNEFCRKHFLIG1 LLREVGF 1157 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



ALQDN YE IRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQLYLPFVGLLLENI 3RL 1116 

ALDAEGEGISF VQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAALYLPLVGI ILDAL ?-- 114 3 

ILDPDAEGLFG LHKKVINTMVHNLLSSHDSDPRYSDPQIKARVAMLYLPLIGI IMETV ?-- 1254 

ALQEFR £ VRLIAISVLKNLLIKHSFDDRYASRSHQARIATLYLPLFGLLIENV2RI 1108 

ALEPEAEGAFI LHKKAISAVHSLLCGHDTDPRYAEATVKARVAELYLPLLS IARDTL ?-- 1177 

ALQEDQ E VRHLALAVLKNLMAKHSFDDRYREPRKQAQIASLYMPLYGMLLDNM ?RI 1213 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



AGRDT LYSCA AMPN-S ASRDEFPCGFTSPANRGSLSTDKDTAYGS 1160 

QL CDFTVADTRRYRTSGSD 1162 

QLY DFTETHNQRGRP I C I ATDD - - 127 6 

NVRDVSP.FPVNAGMTVKDESLALPA-VNPLVTPQKGSTLDNSLHKDLLGAISGIASPYTT 1167 

RLH — DFAEGPGQRSRLASMLDSDTE 1201 

YLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFSKDVLNSIAAFSSIAIS 127 3 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



FQ-NGHGIKREDSRGSLI PEGATGFPDQGNTGEN TRQSSTRSSVSQYNRLDQYE 1213 

EEQEGAGAINQNVALAIAGNNFNLKT SG I VLS S LP YKQYNMLNADT 1208 

YESESGSMISQTVAMAIAGTSVPQLTR PGSFLLTSTSGRQHTTFSAES 1324 

STPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSE 122 7 

GEGDI AGT INPSVAMAI AGGPLAPGSR AS ISQGPPTASRAGCALSAES 12 4 9 

TVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPLALIGSTLRFDRLDQAE 1327 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



I RSLLMCYLYIVKMISEDTLLTYWNKVSPQELINILILLEVCLFHFRYMGKRNIAP l /HDA 1273 
1RNLMICFLWIMKNADQSLIRKWIADLPSTQLNRILDLLFICVLCFEYKGKQSSDF7STQ 1268 
i RSLLICLLWVLKNADETVLQKWFTDLSVLQLNRLLDLLYLCVSCFEYKGKKVFER WSL 1384 
] KSLLMCFLYILKSMSDDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIAB MQEG 1287 
SRTLLACVLWVLKNTEPALLQRWATDLTLPQLGRLLDLLYLCLAAFEYKGKKAFEPINSL 1309 
1 RSLLMCFLHIMKTISYETLIAYWQRAPS PEVSDFFSILDVCLQNFRYLGKRNI IR KIAA 1387 



WLSKHFGIDR 

VLQKS RDVKAR- 
T FKKS KDMRAK- 
LGPIVHDRKS — 
TFKKSLDMKAR- 



— KSQTMPALRNRSGVMQARLQHLSSLESS 1311 

-LEEALLRGEGARGEMMRRRAPGNDRFPGLNEN 1311 

-LEEAILGSIGARQEMVRRSRGQLERSPSGSAFGSQ 14 30 

QTLPVSRNRTGMMHARLQQLGSLDNS--- 1323 

- LE E A I LGT I GARQEMVRRS RERS P FGNPEN 13 50 



AFKFVQSTQHNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHKQHRSQTLPI IRGKN 14 4 2 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



-FTLNHSSTTTELf DI 



-LRWKKEQTHWRQANEKLDKTK? ELDQEAL 



ENLRWRKDMTHWRQNTEKLDKSR4E 
LTFNHSYGHS 



FHQALLEGNTATEVS LTVLDT I S F 
ISGNLATEAHLIIL 
I EHEAL I DGNLATEANLI I LDTLE I 



FF TQCFKTQLL 1359 
LDMQENI IQASS-ALD 1368 
IVJ/QTVS-VTE 1489 
1371 

— VRWRKSVTHWKQTSDRVDKTKliEMEHEALVEGNIA.TEASLVVLDTLEIIpQTVM-LSE 1407 
--ALSNPKLLQMLDNTMTSNSNE: DIVHHVDTEANIATEGCLTILDLVSLF rQTHQRQLQ 1500 



D; DVLHQSLLEANIATEVCLTALDTLSLF rLAFKNQLL 
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WDGHNTLMKKVFDIHLAFLKNGQSEVSLKHVFASLPAFISKFPSAFFKGRVNMCAAFCY 14 19 

CKDS LLGGVLRVLVNSLNCDQSTTYLTHCFATLRALIAKFGDLLFEEEVEQCFDLCH 1425 

SKES ILGGVLKVLLHSMACNQSAVYLQHCFATQRALVSKFPELLFEEETEQCADLCL 154 6 

ADHGHNP LMKKVFDVY LC FLQKHQS ETALKNVFTALRS L I YKFPS T FYEGRADMCAALC Y 14 31 
ARES---VLGAVLKWLYSLGSAQSALFLQHGLATQRALVSKFPELLFEEDTELCADLCL 14 64 
QCDCQNSLxMKRGFDTYMLFFQVNQSATALKHVFASLRLFVCKFPSAFFQGPADLCGSFCY 1560 
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EVLKCCTSKISSTRNEASALLYLLMRNNFEYTKRKTFLRTHLQI I IAVSQLIADVALSGG 1479 

QVLHHC S S SM DVTRSQACAT L Y L LMR- - FS FGATSN FARVKMQVTMS LAS LVGRAPD FNE 14 8 3 

RLLRHCSSSIGTIRSHPSASLYLLMR — QNFEIGNNFARVKMQVPMSLSSLVGTSQNFNE 1604 

EILKCCNSKLSS IRTEASQLLYFLMRNNFDYTGKKS FVRTHLQVI ISVSQLIADWGIGE 14 91 

RLLRHCGSRISTIRTHASASLYLLMR— QNFEIGHNFARVKMQVTMSLSSLVGTTQNFSE 1522 

EVLKCCWRSRSTQTEASALLYLFMRKNFEFNKQKS I VRSHLQLIKAVSQLIADAG- IGG 1619 
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S R FQE S L F 1 1 NN FAN S DR PMKATAFPAE VKDL T KR I RTVLMATAQMKEHEKD PEIML I DLQ 1539 

EHLRRSLRTILAYSEEDTAMQMTPFPTQVEELLCNLNSILYDTVKMREFQEDPEMLMDLM 154 3 

EFLRRSLKTILTYAEEDLELRETTFPDQVQDLVFNLHMILSDTVKMKEHQEDPEMLIDLM 1664 

TRFQQSLSII^CANSDRLIKHTSFSSDVKDLTKRIRTVLMATAQMKEHENDPEMLVDLQ 1551 

EHLRRSLKTILTYAEEDMGLRDSTFAEQVQDLMFNLHMILTDTVKMKEHQEDPEMLIDLM 158 2 

SRFQHSLAITNNFANGDKQMKNSNFPAEVKDLTKRIRTVLMATAQMKEHEKDPEMLVDLQ 1679 
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transmembrane 



FS EAAMC YVHVAALVAE FI HRKK- 

I SMLEDH- 



YS LAKS Y AS T PE LRKT W L DSMAK I HVKNGI 

YRIAKSYQASPDLRLTWLQNMAEKHTKKKC YTEAAMCLVHAAALVAEY: 

YRIAKGYQTSPE-RLTWLQNMAGKHSERSI* HAEAAQCLVHSAALVAEYlisMLEDR 

YS LAKS YASTPELRKTWLDSMAR I HVKNGI LSEAAMCYVHVTALVAEYI TRKG 

YRIARGYQGSPDLRLTWLQNMAGKHAELGh HAEAAQCMVHAAALVAEYl ALLEDQ 

YS LANS YAS T PELRRTW LESMAK I HARNGI 



1592 
1598 
1718 
16.04 
1637 

LSEAAMCYIHIAALIAEYlJKRKGYWKVEKI 1739 



LFPNGCSAFKKITPNIDEEGAMKEDAGMMD 1622 

SYLPVGSVS FQNISSNVLEESWSEDTLSPDEDGV 1633 

: KYLPVGCVT FQN I SSNVLEESAVS DDWS PDEEG I 17 53 

VFRQGCTAFRVITPNIDEEASMMEDVGMQD 1634 

RHL PVGCVS FQN I SSNVLEESAI SDDI LS PDEEG F 167 2 

CTASLLSEDTHPCDSNSLLTTPSGGSMFSMGWPAFLS ITPNIKEEGAAKEDSGMHD 1795 
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IT AM 



I I S E I S KL I VP I YEKRRE FEKLTQV YRTI HG 



VHYSEEVLLELLEQCVDGLWKAERYEi: 

CAGQY FTESGLVGLLEQAAEL FS TGGLYETVNEVYKLV I P I LEAHRE FRKLTLtIhSKiJqR 
CSGKY FTESGLVGLLEQAAAS FSMAGMYEAVNEVYKVLI P I HEANRDAKKLST I HGKI QE 

VHFNEDVLMELLEQCADGLWKAERYELIADIYKLIIPIYEKRR 

CSGKMFTELGLVGLLEQAAGYFTMGGLYEAVNEAnfKNLIPILEAHRDYKKLAAVHGKIQE 
— TPYNENILVEQLYMCGEFLWKSERYELIADVNKPI IAVFEKQRDFKKLSDL YYTJIHR 



1679 
1693 
1813 
1677 
1732 
1852 



DOCK motif 



ITAM DOCK moti: 

YTKlLEVMHTKKRLI^ljFFR^jA^GQdFFEEEltaKE^ 

FDS ]j VNKDH- - KRM FGH Y FRMG E1FG - £ K FG DLL E QE HV YKE E A I TKL PE I S HRLEAI -YG 
FSK 3 VHQSTGWERM FGT1Y FR\iG FtYG- 1 KFGDLE EQE E ^ YKE E AI TKLAE ISHRLEGI YG 

1 _j 1 — | EFFEDEEGKEYl YKEPKLTPLSEISQRLLKI YS 

FTKljMHQSSGWERVFGTfYFRVGF YG-^HFGDLE EQEe{v YKEEjsiTKLAEISHRLEEI YT 



YLIO|AEWNSEKRLFGI^YR^e|ygQ^FFEEE^ 



ITAM 

1739 
1750 
1872 
1710 
1791 
1912 
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IT AM 

"eI^FGTENVKI IQDSDKVNAKELDPf 



QC FGAEFVEVIKDSTPVDKTKLDPr KAYIQ] T FVE: 



EP FGEDWEVTKDSNPVDKCKLDP^fKAY 
DKFGSENVKMI 

EBFGDDWEIIKDSYPVDKSKLDSdKAYIQ 
DK}FGADNVKI IQDSNKVNPKDLDP. 1 YAYIQ\ 




YFDDKELTERKTEFERNHNISRFV 1799 
YFDEYEMKDRVTYFEKNFNLRRFM 1810 
YFDTYEMKDRITYFDKNYNLRRFM 1932 
FFDEKELQERKTEFERSHNIRRFM 1770 
YFDTYELKDRVTYFDRNYGLRTFL 18 51 
FFEEECEIEDRKTDFEMHHNINRFV 1972 



DOCK motif 



feapytlsgkkqgc i eeqckrrt i lttsns fl 
yttpft legrprgelheqyrrntvlttmhafejy 
yctpftldgrahgelheqfkrktilttshaff|y 
femp ftqtgkrqggveeqckrrt i ltai hc fj 
fctpftpdgrahgelpeqhkrktllstdhafejyikt|rrrvchree^ 
fetpftlsgkkhggvaeqckrrtilttshlf] 



Coiled-coil 



IPINCEQQn LKP I DGATDE I KD 
I KllR I S V I QKEE F\ LT P I E VA I EDMKK 
iktJrvnvthkeei] LTPIEVAIEDMQK 
I pvmyqhhti|lnpievaidemsk 

LTPVEVAIEDMQK 
IQVISQSSTdLNPIEVAIDEMSR 



ktaelqklcsstdvdmiqlqlklqc va^svqvnagplayaraflndsqaskyppkkvselk 
ktlqlavainqeppdakmlqmvloc svgatvnqgplevaqvflae ipadpklyrhhnklr 
ktqelafathqdpadpkmlqmvlqc svgttvnqgplevaqvflseipsdpklfrhhnklr 
kvaelrqlcssaevdmiklqlklqc svsvqvnagplayaraflddtntkrypdnkvkllk 
ktrelafateqdppdakmlqmvlqc svgptvnqgplevaqvflae I pedpklfrhhnklr 
kvselnqlctmeevdmislqlklqc svsvkvnagpmayarafleetnakkypdnqvkllk 



Coi led-coil 



dmfrkfiqacs: alelnerlikedqveyheglksnfrdmvkelsdiiheqilqedtmhsp 
lcfkefimrcglaveknkrlitadqreyqqelkknynklkenlrpn ierkipelykpifr 
lcfkdftkrcehalrknksligpvqkeyqrelgklssp 

EVFRQFVEACG('AUVVNERLIKZIX3LEYQEEMKANYREKAKELSEI^HEQICPLEEKTS- 
LCFKDFCKKCEI iALRKNKALIGPDQKEYHRELERNYCRLREALQPL ltqrlpqlmaptp- 
E I FRQFADACG< 'ALDVNERLIKEDQLEYQEELRSHYKDMLSELSTV yiNEQITGRDDLSKR 



1859 
1870 
1992 
1830 
1911 
2032 



1919 

1930 
2052 
1890 
1971 

2092 



1979 
1990 
2090 
1949 
2030 
2152 



PD Z liqa nd 
WMSNTLHVFCAISGTSSDRGYGSPF{Y££3-- 2008 
VESQKRDS FHRSSFRKCETQLSQGS 2015 

VLPNSLHI FNAISGTPTSTMVHGMTS ^SSWj 1980 

— PGLRNSLNRASFRKADL 2047 

GVDQTCTRVISKATPALPTVSISS (SAEV-h - 2180 
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FIG 15/1 



GCT GCC AAG AGA ATC ATG ATC ATC TGT AAA GCC CTC AAC TCA AAT CTT CAG GGA TGT GTT 120 9 
ala ala lys arg ile met ile ile cys lys ala leu asn ser asn leu gin gly cys val 

ACG GAG AAT GAA AAT GAT CCG ATA ACG AAT ATT GAG CCT TTT TTT GTG AGT GTG GCA CTT 1269 
thr glu asn glu asn asp pro ile thr asn ile glu pro phe phe val ser val ala leu 

TAT GAC CTC AGA GAC AGC AGG AAG ATT TCT GCT GAT TTT CAT GTG GAT CTA AAC CAT GCT 132 9 
tyr asp leu arg asp ser arg lys ile ser ala asp phe his val asp leu asn his ala 

GCT GTC AGA CAG ATG CTC TTG GGG GCT TCT GTG GCT TTG GAA AAT GGC AAC ATC GAC ACC 138 9 
ala val arg gin met leu leu gly ala ser val ala leu glu asn gly asn ile asp thr 

ATC ACT CCA AGA CAA TCA GAA GAA CCT CAC ATC AAG GGA CTT CCA GAG GAA TGG CTA AAA 144 9 
ile thr pro arg gin ser glu glu pro his ile lys gly leu pro glu glu trp leu lys 

TTT CCA AAG CAG GCT GTA TTT TCT GTA AGC AAT CCA CAT TCT GAA ATT GTT TTG GTG GCC 1509 
phe pro lys gin ala val phe ser val ser asn pro his ser glu ile val leu val ala 

AAA ATC GAA AAA GTC TTG ATG GGA AAC ATT GCA AGT GGT GCC GAA CCT TAT ATT AAG AAC 1569 
lys ile glu lys val leu met gly asn ile ala ser gly ala glu pro tyr ile lys asn 

CCA GAC TCC AAC AAG TAT GCA CAA AAG ATA CTA AAA TCC AAC AGA CAA TTC TGC AGC AAA 162 9 
pro asp ser asn lys tyr ala gin lys ile leu lys ser asn arg gin phe cys ser lys 

TTG GGA AAA TAC CGA AGG GCT TTT GCT TGG GCA GTA AGA TCA GTA TTT AAG GAC AAC CAG 168 9 
leu gly lys tyr arg arg ala phe ala trp ala val arg ser val phe lys asp asn gin 

GGA AAT GTG GAC AGA GAC TCA AGA TTT TCA CCA TTG TTT AGA CAA GAA AGT AGC AAG ATT 17 4 9 
gly asn val asp arg asp ser arg phe ser pro leu phe arg gin glu ser ser lys ile 

TCA ACT GAG GAC CTA GTT AAA CTA GTA .TCA GAT TAT AGA AGG GCC GAC AGA ATA AGC AAA 18 0 9 
ser thr glu asp leu val lys leu val ser asp tyr arg arg ala asp arg ile ser lys 

ATG CAG ACC ATT CCT GGA AGC CTG GAT ATT GCT GTT GAC AAC GTT CCT TTG GAG CAT CCA 18 69 
met gin thr ile pro gly ser leu asp ile ala val asp asn val pro leu glu his pro 

AAT TGT GTA ACA TCG TCC TTT ATC CCT GTC AAG CCT TTC AAC ATG ATG GCT CAA ACA GAA 192 9 
asn cys val thr ser ser phe ile pro val lys pro phe asn met met ala gin thr glu 

CCC ACA GTG GAG GTG GAA GAA TTT GTT TAC GAT TCA ACA AAG TAT TGT CGG CCT TAC AGA 198 9 
pro thr val glu val glu glu phe val tyr asp ser thr lys tyr cys arg pro tyr arg 

GTA TAT AAA AAT CAA ATT TAT ATT TAC CCC AAA CAC CTC AAG TAT GAT AGC CAG AAA TGC 2 04 9 
val tyr lys asn gin ile tyr ile tyr pro lys his leu lys tyr asp ser gin lys cys 

TTC AAC AAG GCA CGG AAT ATA ACT GTG TGC ATT GAA TTC AAA AAT TCA GAT GAA GAA AGT 210 9 
phe asn lys ala arg asn ile thr val cys ile glu phe lys asn ser asp glu glu ser 

GCC AAG CCC CTG AAG TGT ATT TAT GGA AAA CCT GAA GGG CCC CTC TTC ACC TCA GCC GCC 2169 
ala lys pro leu lys cys ile tyr gly lys pro glu gly pro leu phe thr ser ala ala 

TAC ACA GCA GTT CTG CAC CAC TCT CAG AAT CCG GAT TTC TCA GAT GAG GTG AAA ATT GAG 222 9 
tyr thr ala val leu his his ser gin asn pro asp phe ser asp glu val lys ile glu 

CTA CCA ACA CAA CTC CAT GAG AAA CAC CAT ATT TTG TTT TCT TTT TAT CAC GTC ACC TGT 228 9 
leu pro thr gin leu his glu lys his his ile leu phe ser phe tyr his val thr cys 

GAC ATC AAT GCA AAA GCT AAT GCC AAA AAG AAG GAG GCT CTG GAA ACG TCA GTT GGA TAT 234 9 
asp ile asn ala lys ala asn ala lys lys lys glu ala leu glu thr ser val gly tyr 

GCT TGG CTT CCT CTG ATG AAA CAC GAT CAG ATA GCT TCT CAA GAG TAC AAC ATC CCA ATA 24 09 
ala trp leu pro leu met lys his asp gin ile ala ser gin glu tyr asn ile pro ile 
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AAT ACA TCT AAT CAG GGG TCT AGA GAT GAT CTA AGC ACC AAT GGA GGA TTT CAA AGC CAG 372 9 
asn thr ser asn gin gly ser arg asp asp leu ser thr asn gly gly phe gin ser gin 

ACA GCT ATC AAA CAT GCA AAC TCT GTG GAT ACA TCA TTT TCT AAA GAT GTT TTA AAT TCC 37 8 9 
thr ala ile lys his ala asn ser val asp thr ser phe ser lys asp val leu asn ser 

ATA GCA GCA TTT TCA TCA ATA GCT ATT TCT ACA GTA AAC CAT GCT GAC TCC AGA GCA TCT 38 4 9 
ile ala ala phe ser ser ile ala ile ser thr val asn his ala asp ser arg ala ser 

TTA GCA AGT CTT GAC TCC AAT CCA AGT ACC AAT GAG AAG AGC AGT GAG AAG ACG GAC AAC 3 90 9 
leu ala ser leu asp ser asn pro ser thr asn glu lys ser ser glu lys thr asp asn 

TGT GAA AAG ATC CCA AGA CCC TTG GCT TTG ATT GGC TCA ACT CTT CGA TTT GAC AGG TTA 3 969 
cys glu lys ile pro arg pro leu ala leu ile gly ser thr leu arg phe asp arg leu 

GAT CAA GCA GAA ACC AGG AGT CTC CTG ATG TGT TTT CTT CAC ATT ATG AAA ACG ATT TCG 4 02 9 
asp gin ala glu thr arg ser leu leu met cys phe leu his ile met lys thr ile ser 

TAC GAG ACT CTG ATT GCC TAC TGG CAG AGA GCT CCC AGC CCA GAG GTG TCC GAC TTC TTC 4 08 9 
tyr glu thr leu ile ala tyr trp gin arg ala pro ser pro glu val ser asp phe phe 

AGC ATC TTG GAC GTT TGT CTT CAA AAT TTC AGA TAC CTA GGA AAA CGC AAC ATA ATA AGA 414 9 
ser ile leu asp val cys leu gin asn phe arg tyr leu gly lys arg asn ile ile arg 

AAA ATT GCT GCT GCA TTT AAA TTT GTG CAG TCC ACC CAG AAC AAT GGA ACT CTC AAA GGA 4209 
lys ile ala ala ala phe lys phe val gin ser thr gin asn asn gly thr leu lys gly 

TCC AAT CCT TCC TGC CAG ACA TCA GGG CTC TTG GCA CAA TGG ATG CAC TCC ACT TCC AGG 4269 
ser asn pro ser cys gin thr ser gly leu leu ala gin trp met his ser thr ser arg 

CAT GAA GGC CAT AAG CAG CAC AGA TCA CAA ACT TTA CCT ATA ATT CGA GGC AAA AAT GCA 4 32 9 
his glu gly his lys gin his arg ser gin thr leu pro ile ile arg gly lys asn ala 

CTT TCT AAC CCC AAA CTC TTA CAG ATG TTA GAC AAT ACC ATG ACC AGC AAC TCC AAT GAA 4 38 9 
leu ser asn pro lys leu leu gin met leu asp asn thr met thr ser asn ser asn glu 

ATA GAC ATC GTG CAT CAT GTA GAC ACT GAG GCC AAT ATA GCT ACG GAG GGT TGC CTC ACT 44 4 9 
ile asp ile val his his val asp thr glu ala asn ile ala thr glu gly cys leu thr 

ATT CTG GAC CTG ■ GTA TCC CTC TTC ACA CAG ACT CAT CAG AGA CAA CTC CAA CAA TGT GAC 4 50 9 
ile leu asp leu val ser leu phe thr gin thr his gin arg gin leu gin gin cys asp 

TGT- CAA AAT TCA TTG ATG AAA AGG GGC TTT GAT ACC TAC ATG CTC TTT TTC CAA GTC AAT 4 5 69 
cys gin asn ser leu met lys arg gly phe asp thr tyr met leu phe phe gin val asn 

CAG TCA GCC ACA GCG CTG AAG CAT GTG TTT GCC TCC TTG AGA CTG TTT GTA TGC AAG TTT 4 62 9 
gin ser ala thr ala leu lys his val phe ala ser leu arg leu phe val cys lys phe 

CCT TCA GCG TTC TTT CAA GGG CCT GCT GAC CTC TGT GGA TCA TTC TGT TAC GAA GTC CTA 4 68 9 
pro ser ala phe phe gin gly pro ala asp leu cys gly ser phe cys tyr glu val leu 

AAA TGC TGT AAC CAC AGG TCA CGG TCA ACT CAG ACA GAA GCC TCA GCC CTT CTG TAC TTG 474 9 
lys cys cys asn his arg ser arg ser thr gin thr glu ala ser ala leu leu tyr leu 

TTC ATG AGG AAG AAT TTT GAA TTT AAC AAG CAG AAG TCA ATT GTC CGG TCC CAC TTA CAA 4 809 
phe met arg lys asn phe glu phe asn lys gin lys ser ile val arg ser his leu gin 

CTC ATC AAA GCT GTG AGC CAG TTA ATA GCC GAT GCT GGG ATT GGA GGC TCT CGG TTT CAA 4 8 69 
leu ile lys ala val ser gin leu ile ala asp ala gly ile gly gly ser arg phe gin 

CAT TCG CTT GCA ATT ACC AAT AAT TTC GCC AAT GGA GAT AAG CAA ATG AAA AAC AGC AAT 4 92 9 
his ser leu ala ile thr asn asn phe ala asn gly asp lys gin met lys asn ser asn 



FIG 15/4 



TTC CCA GCA GAG GTG AAG GAC CTG ACT AAG CGT ATA AGG ACT GTT TTG ATG GCC ACA GCT 4 98 9 
phe pro ala glu val lys asp leu thr lys arg ile arg thr val leu met ala thr ala 



CAG ATG AAG GAG CAC GAG AAG GAC CCC GAG ATG CTG GTG GAT CTC CAG TAC AGC CTG GCA 504 9 
gin met lys glu his glu lys asp pro glu met leu val asp leu gin tyr ser leu ala 

AAC TCC TAC GCA AGC ACT CCT GAA CTA CGC AGG ACC TGG CTG GAA AGT ATG GCC AAG ATT 510 9 
asn ser tyr ala ser thr pro glu leu arg arg thr trp leu glu ser met ala lys ile 

CAT GCC AGA AAC GGA GAT TTA TCT GAG GCT GCC ATG TGT TAC ATC CAT ATT GCT GCT CTC 5169 
his ala arg asn gly asp leu ser glu ala ala met cys tyr ile his ile ala ala leu 

ATT GCA GAG TAT CTG AAA AGA AAG GGT TAC TGG AAA GTG GAA AAG ATT TGC ACA GCA TCC 522 9 
ile ala glu tyr leu lys arg lys gly tyr trp lys val glu lys ile cys thr ala ser 

CTG CTC TCG GAG GAT ACC CAC CCC TGT GAT AGC AAC TCA TTA CTA ACA ACT CCC AGT GGA 52 8 9 
leu leu ser glu asp thr his pro cys asp ser asn ser leu leu thr thr pro ser gly 

GGA AGC ATG TTC TCT ATG GGA TGG CCA GCT TTT TTG AGC ATT ACA CCC AAC ATT AAG GAA 534 9. 
gly ser met phe ser met gly trp pro ala phe leu ser ile thr pro asn ile lys glu 

GAA GGA GCC GCG AAA GAG GAT TCT GGA ATG CAC GAT ACA CCC TAC AAT GAG AAT ATC CTG 54 0 9 
glu gly ala ala lys glu asp ser gly met his asp thr pro tyr asn glu asn ile leu 

GTG GAG CAG CTA TAC ATG TGT GGG GAG TTT CTC TGG AAG TCT GAG CGA TAT GAA CTC ATT 54 69 
val glu gin leu tyr met cys gly glu phe leu trp lys ser glu arg tyr glu leu ile 

GCT GAT GTC AAC AAG CCC ATC ATT GCT GTC TTT GAG AAA CAA CGA GAC TTC AAA AAA TTG 552 9 
ala asp val asn lys pro ile ile ala val phe glu lys gin arg asp phe lys lys leu 

TCA GAT CTC TAC TAC GAC ATT CAT CGG TCA TAT CTG AAA GTG GCA GAG GTG GTG AAT TCG 558 9 
ser asp leu tyr tyr asp ile his arg ser tyr leu lys val ala glu val val asn ser 

GAG AAG CGG CTG TTT GGT CGC TAC TAT CGT GTG GCA TTT TAT GGG CAG GGC TTT TTT GAA 564 9 
glu lys arg leu phe gly arg tyr tyr arg val ala phe tyr gly gin gly phe phe glu 

GAA GAA GAA GGT AAA GAG TAT ATT TAT. AAA GAG CCT AAG CTG ACA GGT CTG TCC GAG ATT 57 0 9 
glu glu glu gly lys glu tyr ile tyr lys glu pro lys leu thr gly leu ser glu ile 

TCC CAA AGA TTA CTC AAG CTC TAT GCA GAT AAA TTT GGA GCA GAC AAT GTG AAG ATA ATC 57 69 
ser gin arg leu leu lys leu tyr ala asp lys phe" gly ala asp asn val lys ile ile 

CAG GAT TCC AAC AAG GTA AAC CCC AAG GAT TTG GAC CCC AAA TAT GCC TAC ATC CAG GTG 582 9 
gin asp ser asn lys val asn pro lys asp leu asp pro lys tyr ala tyr ile gin val 

ACC TAT GTG ACG CCG TTC TTT GAG GAA AAG GAA ATC GAA GAC CGG AAG ACA GAT TTC GAA 588 9 
thr tyr val thr pro phe phe glu glu lys glu ile glu asp arg lys thr asp phe glu 

ATG CAC CAC AAC ATC AAC CGC TTT GTC TTC GAG ACA CCC TTC ACG CTG TCG GGC AAG AAG 594 9 
met his his asn ile asn arg phe val phe glu thr pro phe thr leu ser gly lys lys 

CAC GGT GGG GTG GCG GAG CAG TGC AAG CGG CGG ACG ATC CTG ACA ACG AGT CAC CTG TTC 6009 
his gly gly val ala glu gin cys lys arg arg thr ile leu thr thr ser his leu phe 

CCC TAC GTG AAG AAG AGG ATC CAG GTC ATC AGC CAA TCA AGC ACA GAG CTG AAT CCT ATT 6069 
pro tyr val lys lys arg ile gin val ile ser gin ser ser thr glu leu asn pro ile 

GAA GTG GCA ATT GAC GAG ATG TCC AGG AAG GTC TCT GAG CTT AAT CAG CTT TGC ACA ATG 612 9 
glu val ala ile asp glu met ser arg lys val ser glu leu asn gin leu cys thr met 

GAA GAA GTG GAC ATG ATC AGC CTA CAG CTC AAA CTG CAA GGA AGT GTC AGC GTG AAG GTT 618 9 
glu glu val asp met ile ser leu gin leu lys leu gin gly ser val ser val lys val 
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AAT GCT GGG CCA ATG GCC TAT GCA CGA GCT TTT CTT GAA GAA ACC AAT GCA AAG AAG TAC 62 4 9 
asn ala gly pro met ala tyr ala arg ala phe leu glu glu thr asn ala lys lys tyr 
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A. Mouse CLASP -1 cDNA «e<ru«nc« 

-169/1 -139/11 

G GAC ACT GAC ATG GAG TGA AGG AGT AGA AAG CTA CAT AGA TGT AGC ATG AGC ACT TCC 

-109/21 -79/31 

CTT CTT AGG GGA GTG GAG AGA TTT GTT CAG CCA GCT CAG TGA AGC CTC AGA AAG AAA AGG 
-49/41 -19/51 

TTT AAG GAC CTC AGT AGT TTT CAA ATA TTT TAT GAC TCC ATT TCA AGG CAT ATG ATG AGC 

met met ser 

12/61 42/71 

TTC CGA GGA AAG GAA TTC TGG AAG AGG AGG CGA ACA GTG AAG AGA GTG AAC CCA GAA GGA 
phe arg gly lys glu phe trp lys arg arg arg thr val lys arg val asn pro glu gly 
72/81 102/91 

ATC CAC AAG GCT GGT GCT CAG GAA AAG CCC AGA CTC TTG GAT CCT TTG GAT TAT GAA ACC 
ile his lys ala gly ala gin glu lys pro arg leu leu asp pro leu asp tyr glu thr 
132/101 162/111 

GTC ATT GAA GAA CTT GAA AAG ACC TAT CGG GAT GAT CCT CTT CAG GAC CTT CTG TTC TTC 
val ile glu glu leu glu lys thr tyr arg asp asp pro leu gin asp leu leu phe phe 
192/121 222/131 

CCC AGT GAT GAC TTC TCA ACA GCC ACG GTT TCT TGG GAT ATC CGA ACA CTC TAT TCG ACG 
pro ser asp asp phe ser thr ala thr val ser trp asp ile arg thr leu tyr ser thr 
252/141 282/151 

GTA CCA GAA GAG GCA GAG CAC AGG GCA GAA AGT CTG CTG GTG AAA GAG GCT TGT AAA TTC 
val pro glu glu ala glu his arg ala glu ser leu leu val lys glu ala cys lys phe 
312/161 342/171 

TAC AGT TCC CAG TGG TAC GTG GTA AAC TAC AAA TAT GAA CAA TAT TCT GGA GAC ATT CGA 
tyr ser ser gin trp tyr val val asn tyr lys tyr glu gin tyr ser gly asp ile arg 
372/181 402/191 

CAG TTG CCC CGA GCA GAA CAC AAG CCA GAG AAA CTT CCT TCA CAT TCC TTT GAG GTT GAC 
gin leu pro arg ala glu his lys pro glu lys leu pro ser his ser phe glu val asp 
432/201 462/211 

CAT GAA GAT GCT GAT AAG GAT GAA GAT ACC ACA TCC CAC TCA TCA TCC AAG GGC GGT GGG 
his glu asp ala asp lys asp glu asp thr thr ser his ser ser ser lys gly gly gly 
492/221 522/231 

GGA GCT GGA GGT ACT GGT GTG TTC AAG TCT GGC TGG CTT TAC AAG GGG AAT TTT AAC AGC 
gly ala gly gly thr gly val phe lys ser gly trp leu tyr lys gly asn phe asn ser 
552/241 * 582/251 

ACT GTG AAC AAT ACT GTC ACT GTT CGG TCA TTC AAA AAG CGC TAC TTC CAG CTu ACT CAG 
thr val asn asn thr val thr val arg ser phe lys lys arg tyr phe gin leu thr gin 
612/261 642/271 

TTG CCT GAT AAC TCC TAC ATT ATG AAC TTT TAC AAA GAT GAA AAG ATA TCC AAA GAG CCC 
leu pro asp asn ser tyr ile met asn phe tyr lys asp glu lys ile ser lys glu pro 
672/281 702/291 

AAG GGC TGC ATC TTT TTG GAT TCC TGC ACA GGT GTA GTA CAG AAT AAC AGG CTG AGG AAA 
lys gly cys ile phe leu asp ser cys thr gly val val gin asn asn arg leu arg lys 
732/301 762/311 

TAT GCC TTC GAG TTG AAG ATG AAC GAC CTG ACC TAT TTT GTG CTG GCT GCT GAA ACA GAG 
tyr ala phe glu leu lys met asn asp leu thr tyr phe val leu ala ala glu thr glu 
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ATT TTT GAA 


AAT 
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TTG 


ATG 
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1152/441 
















1182/451 
















ATC ATC TGT 
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ATT 
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GTG 


AGC GTG GCA 


CTT 


TAC 


GAC 


CTC 


AGA 


GAT 


AAC 


pro val thr 




ile 


glu 






phe 


val 


ser val ala 


leu 


tyr 


asp 


leu 
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asp 
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1272/481 
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AGG AAG ATC 


TCT 
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CAT 
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pro 
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val 


arg 


gin 


met 


leu 


1332/501 
















1362/511 
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CCG 


GCC 


TTG 
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leu 
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gly 
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pro 


arg 


gin 
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1392/521 
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GAA GAA CCT 


CAC 


ATA 
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pro 
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pro 


lys 


gin 


ala 


val 


1452/541 
















1482/551 
















TTC TCT GTG 


AGT 


GAT 
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TCT 


GAA 


ATT 


GTT TTG GTG 
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AAA 


GTC 


TTA 


phe ser val 


ser 


asp 


pro 
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ser 
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ile 


val leu val 


ala 


lys 


val 


glu 


lys 


val 


leu 


1512/561 
















1542/571 
















ATG GGA AAC 


ATC 


GGG 


AGT 


GGT 


GCT 


GAA 


CCT 


TAT ATC AAG 


AAT 
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TCC 
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TTT 
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ile 
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ser 
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glu 


pro 


tyr ile lys 




pro 


asp 


ser 
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lys 




1572/581 
















1602/591 
















GCA CAA AAG 


ATT 


CTA 


AAA 


TCC 


AAT 


AGG 


CAG 


TTC TGC AGC 


AAA 


TTG 


GGG 


AAA 


TAT 


CGG 


ATG 


ala gin lys 
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leu 
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arg 


gin 


phe cys ser 


lys 


leu 


gly 


lys 


tyr 




met 


1632/601 
















1662/611 
















CCG TTT GCT 


TGG 


GCA 


GTG 


AGA 


TCT 


GTA 


TTT 


AAG GAC AAC 
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GGG 


AAC 
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AGG 


GAC 
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trp 
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arg 


ser 


val 


phe 




gin 


gly 
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val 


asp 


arg 


asp 


1692/621 
















1722/631 
















TCA CGG TTT 


TCA 


CCA 


CTG 


TAT 


AGA 


CAA 


GAA 


AGC AGC AAG 


ATG 


TCG 


TCT 
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CTA 


ser arg phe 


ser 


pro 


leu 


tyr 


arg 


gin 




ser ser lys 




ser 


ser 


glu 


asp 


leu 


leu 


1752/641 
















1782/651 
















AAA CTG GTG 


TCA 


GAC 


TAC 


AGA 


AGA 


GCT 


GAC 


CGG ATA AGC 


AAA 


ATG 


CAG 


AGC 


ATC 


CCT 


GGA 


lys leu val 


ser 


asp 


tyr 


arg 


arg 


ala 


asp 


arg ile ser 


lys 


met 




ser 


ile 


pro gly 


1812/661 
















1842/671 
















AGC CTG GAT 


ATT 


GCT 


GTG 


GAC 


AAC 


ATT 


CCC 


TTG GAG CAC 


CCA 


AAC 


TGT 


GTA 


ACA 


TCG 


TCC 


ser leu asp 


ile 


ala 


val 


asp 




ile 


pro 


leu glu his 


pro 
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cys 


val 
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ser 


ser 
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pro 
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asp 
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ATT TAT GGG 
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CCT 
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phe 
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AAT 
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GAT 
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asp 
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2232/801 
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CAC 


CTT 
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TCC 


TTT 
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ATA 
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ATT 
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2472/881 
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CAT 
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CCT 
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TAC 
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3702/1291 
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GAC 


CTC 
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ACT 


AAT 
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ala 
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GAT 
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TTT 
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ile 


4812/1661 












CAG TTA ATA 


GCT 


GAT 


GCG 


GGG 


ATC 


gin leu lie 


ala 


asp 


ala 


gly 


ile 


4872/1681 










AAC AAC TTT 


GCC 


AAT 


GGA 


GAT 


AAA 


asn asn phe 


ala 


asn 


gly 


asp 


lys 


4932/1701 












GAT CTG ACT 


AAA 


CGC 


ATA 


AGG 


ACT 


asp leu thr 


lys 


arg 


ile 


arg 


thr 


4992/1721 












AAG GAC CCA 


GAG 


ATG 


CTG 


GTG 


GAC 


lys asp pro 


glu 


met 


leu 


val 


asp 


5052/1741 












CCG GAG TTA 


CGG 


AGG 


ACC 


TGG 


CTG 


pro glu leu 


arg 


arg 


thr 


trp 




5112/1761 












CTG TCT GAG 


GCT 


GCG 


ATG 


TGT 


TAC 


leu ser glu 


ala 






cys 


tyr 


5172/1781 












CGC AAG GGT 


TAC 


TGG 


AAA 


ATG 


GAA 


arg lys gly 


tyr 


trp 


lys 


met 


glu 


5232/1801 










CAA CCC TGT 


GAT 


AGC 


AAC 


TTA 


TTA 


gin pro cys 


asp 


ser 


asn 


leu 


leu 







4182/1451 
















AAT 


GGG 


ACT CTG AAG 


GGA 


TCC 


AAT 


CCT 


TCC 


TGC 


CAG 


asn 


gly 


thr leu lys 


gly 


ser 


asn 


pro 




cys 


gin 






4242/1471 










ATG 


CAC 


ACA ACT TCT 


GGC 


CAC 


GAG 


GGA 


CAT 


AAG 


CAG 




his 


thr thr ser 
4302/1491 


gly 


his 


g u 


<3 y 


his 


ys 


g n 


ATC 


CGA 


GGC AAA AAT 


GCA 


CTT 


TCC 


AAC 


CCC 


AAA 


CTT 


ile 




gly lys asn 


ala 


leu 


ser 


asn 


pro 
P 


lys 


leu 






4362/1511 












AAC 


AGC 


AAT TCC AAT 


GAA 


ATA 


GAC 


ATT 


GTC 


CAC 


CAT 






4422/1531 


glu 


ile 


asp 


ile 


val 


his 


his 


ACC 


GAG 


GTC TGC CTC 


ACT 


ATT 


CTG 


GAC 


CTG 


CTG 


TCT 


thr 


glu 


val cys leu 


thr 


lie 


leu 


asp 


leu 


leu 


ser 






4482/1551 














CAG 


CTC 


CAA CAA TCC 


GAC 


TGT 


CAA 


AAT 


TCA 


CTC 


ATG 


gin 


leu 






cy 


oln 


asn 


ser 


eu 


me 






4542/1571 












CTG 


TTT 


TTC CAA GTC 


AAC 


CAG 


TCA 


GCC 


TCA 


GCC 


CTG 


leu 


phe 


phe gin val 


asn 


gin 


ser 


ala 


ser 


ala 


leu 






4602/1591 














CTG 


TTT 


GTG TGC AAG 


TTT 


CCG 


TCA 


GCG 


TTT 


TTC 


CAA 


leu 


phe 


val cys lys 


phe 


pro 


ser 


ala 


phe 


phe 


gin 






4662/1611 












TTC 


TGC 


TAT GAA GTC 


CTC 


AAA 


TGC 


TGT 


AAC 


CAC 


AGG 


phe 


cys 


tyr glu val 


leu 


lys 


cys 


cys 


asn 


his 


arg 






4722/1631 














TCA 


GCA 


CTT CTA TAC 


TTC 


TTC 


ATG 


AGG 


AAG 


AAC 


TTT 




ala 


leu leu tyr 
4782/1651 


phe 


h 
P e 


met 


arg 


ys 


asn 


P e 


GTC 


CGG 


TCT CAC TTA 


CAA 


CTC 


ATC 


AAA 


GCA 


GTG 


AGC 


val 


arg 


ser his leu 


gin 


leu 


ile 


lys 


ala 


val 


ser 






4842/1671 














GGA 


GGG 


TCT CGC TTT 


CAA 


CAC 


TCC 


CTT 


GCA 


ATC 


ACG 






ser arg phe 
4902/1691 


gin 


his 


ser 


leu 


ala 


ile 


thr 


CAG 


ATG 


AAA AAC AGC 


AAT 


TTC 


CCA 


GCA 


GAG 


GTG 


AAA 


gin 


met 


lys asn ser 
4962/1711 




phe 


pro 


ala 


glu 


val 


lys 


GTT 


TTG 


ATG GCC ACA 


GCC 


CAG 


ATG 


AAG 


GAG 


CAT 


GAG 


val 


leu 


met ala thr 
5022/1731 


ala 


gin 


met 


lys 


glu 


his 


glu 


CTT 


CAA 


TAC AGC CTA 


GCA 


AAC 


TCC 


TAT 


GCA 


AGT 


ACC 


leu 


gin 


tyr ser leu 


ala 






tyr 


ala 




thr 






5082/1751 














GAA 


AGC 


ATG GCC AAG 


ATT 


CAT 


GCA 


AGA 


AAT 


GGA 


GAC 


glu 


ser 


met ala lys 
5142/1771 


ile 


his 


ala 


arg 




giy 


asp 


ATC 


CAT 


ATA GCT GCA 


CTT 


ATT 


GCA 


GAA 


TAC 


CTG 


AAG 


ile 


his 


ile ala ala 


leu 


ile 


ala 


glu 


tyr 


leu 


lys 






5202/1791 










AAG 


ATT 


TGC ACA CCA 


CCC 


CTG 


CTT 


CCA 


GAA 


GAC 


ACC 


lys 


ile 


cys thr pro 


pro 


leu 


leu 


pro 




asp 


thr 






5262/1811 










CTA 


ACA 


ACT CCA GGC 


GGA 


GGA 


AGC 


ATG 


TTC 


TCT 


ATG 


leu 


thr 


thr pro gly 


gly 


gly 


ser 


met 


phe 


ser 





It 
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5292/1821 
















5322/1831 










GGA TGG CCA 


GCC 


TTT 


CTG 


AGC 


ATC 


ACC 


CCA 


AAC ATT AAA 


GAA 


GAA 


GGA GCA ATG 


AAA GAG 


gly trp pro 


ala 


phe 


leu 


ser 


ile 


thr 


pro 


asn ile lys 


qlu 


qlu 


gly ala met 


lys glu 


5352/1841 
















5382/1851 








GAT TCT GGA 


ATG 


CAA 


GAC 


ACC 


CCG 


TAC 


AAT 


GAG AAC ATC 


CTG 


GTG 


GAA CAG CTG 


TAT ATG 


asp ser gly 


met 




asp 


thr 


pro 


tyr 


asn 




leu 


val 


glu gin leu tyr met 


5412/1861 
















5442/1871 










TGT GTG GAG 


TTC 


CTT 


TGG 


AAG 


TCT 


GAA 


CGA 


TAC GAA CTC 


ATC 


GCT 


GAT GTC AAT 


AAG CCC 


cys val glu 




leu 


trp 


lys 




glu 


a 




ile 


ala 




lys pro 


5472/1881 
















5502/1891 








ATC ATC GCT 


GTC 


TTT 


GAA 


AAG 


CAA 


CGA 


GAC 


TTC AAA AAA 


TTA 


TCA 


GAT CTC TAT 


TAT GAC 


lie lie ala 


val 




glu 


lys 


gin 


arq 


asp 




leu 


ser 


asp leu tyr 




5532/1901 
















5562/1911 








ATC CAC CGG 


TCC 


TAT 


CTG 


AAA 


GTG 


GCA 


GAG 






GAG 


AAG CGA TTG 


TTT GGT 




ser 


tyr 


leu 


lys 


val 


ala 


g u 


val val asn 


e 


g u 


lys arg leu 


phe gly 


5592/1921 
















5622/1931 








CGT TAC TAT 


AGA 


GTG 


GCG 


TTT 


TAT 


GGG 


CAG 


GGA TTC TTT 


GAG 


GAG 


GAG GAA GGT 


AAA GAG 




arg 


val 


ala 


phe 


tyr 


- 


- 

g n 




g u 


g u 


glu glu gly 


lys glu 


5652/1941* 
















5682/1951 








TAT ATC TAC 


AAA 


GAG 


CCT 


AAG 


CTG 


ACA 


GGG 


CTC TCG GAG 


ATC 


TCC 


CAA AGG CTT 


CTC AAG 




lys 


glu 


pro 


lys 


leu 


thr 


g y 




6 


ser 


gin arg leu 


leu lys 


5712/1961* 
















5742/1971 






CTC TAT GCA 


GAC 


AAA 


TTT 


GGA 


GCA 


GAC 


AAT 




ATT 


CAA 


GAC TCC AAC 


AAG GTA 




asp 


lys 


Phe 


- 

Y 


ala 


asp 


asn 


val lys ile 


G 


gin 


asp ser asn 


lys val 


5772/1981 
















5802/1991 








AAC CCC AAG 


GAT 


CTG 


GAC 


CCC 


AAA 


TAT 


GCC 


TAT ATT CAG 


GTG 


ACC 


TAT GTC ACA 


CCA TTC 


asn pro lys 


asp 


leu 


asp 


pro 


lys 


tyr 


ala 




val 


thr 


tyr val thr pro phe 


5832/2001 
















5862/2011 










TTT GAA GAA 


AAG 


GAA 


ATC 


GAG 


GAC 


CGA 


AAG 


ACA GAC TTT 


GAA 


ATG 


CAT CAC AAC 


ATC AAT 


phe glu glu 


lys 


glu 


ile 


glu 


asp 


arq 


1 s 
ys 




lu 
g u 


met 


his his asn 


ile asn 


5892/2021 
















5922/2031 










CGC TTT GTC 


TTT 


GAG 


ACA 


CCC 


TTC 


ACT 


CTG 


TCA GGC AAG 


AAG 


CAC 


GGA GGA GTG 


GCT GAG 


arg phe val 


phe 


glu 


thr 


pro 


phe 


thr 


leu 




lys 


his 


gly gly val ala glu 


5952/2041 
















5982/2051 










CAG TGC AAG 


CGG 


AGG 


ACA 


GTC 


CTG 


ACC 


ACA 


AGC CAC TTG 


TTC 


CCC 


TAC GTA AAG 


AAG AGG 


gin cys lys 


arg 




thr 


val 


leu 


thr 


thr 




he 

P 








6012/2061 
















6042/2071 










ATC CAG GTC 


ATC 


AGC 


CAA 


TCA 


AGC 


ACA 


GAG 


CTG AAT CCT 


ATC 


GAG 


GTG GCA ATT 


GAT GAG 


ile gin val 


ile 




gin 






thr 


glu 


leu asn pro 


ile 


qlu 


val ala ile asp glu 


6072/2081 
















6102/2091 








ATG TCC AGG 


AAG 


GTC 


TCT 


GAG 


CTT 


AAT 


CAG 


CTG TGC ACC 


ACA 


GAG 


GAG GTG GAT 


ATG ATC 


met ser arg 


lys 


val 




glu 


leu 


asn 


gin 


leu cys thr 


thr 


glu 


glu val asp 


met ile 


6132/2101 
















6162/2111 










CGC CTA CAG 


CTC 


AAA 


CTC 


CAG 


GGC 


AGT 


GTC 


AGC GTG AAG 


GTC 


AAT 


GCT GGG CCA 


ATG GCT 


arg leu gin 


leu 


lys 


leu 




gly 




val 




val 




ala gly pro 


met ala 


6192/2121 
















6222/2131* 










TAT GCT CGA 


GCC 


TTT 


CTT 


GAA 


GAA 


ACT 


AAT 


GCA AAG AAG 


TAT 


CCT 


GAC AAC CAA 


GTT AAG 


tyr ala arg 


ala 


phe 


leu 




glu 


thr 




ala lys lys 


tyr 






val lys 


6252/2141 
















6282/2151 








CTA CTA AAG 


GAA 


ATA 


TTC 


AGG 


CAA 


TTT 


GCA 


GAT GCG TGT 


GGG 


CAG 


GCT CTT GAT 


GTG AAT 


leu leu lys 




ile 


phe 


arg 


gin 




ala 


asp ala cys 


gly 


gin 


ala leu asp 


val asn 


6312/2161 
















6342/2171 










GAG CGT CTC 


ATC 


AAG 


GAA 


GAC 


CAG 


CTG 


GAG 


TAC CAG GAA 


GAA 


CTG 


AGG TCC CAT 


TAT AAG 


glu arg leu 


ile 


lys 




asp 


gin 


leu 


glu 


tyr gin glu 


glu 


leu 


arg ser his 


tyr lys 


6372/2181 
















6402/2191 










GAC ATG CTC 


AGT 


GAA 


CTG 


TCT 


GCC 


ATC 


ATG 


AAT GAG CAG 


ATT 


ACG 


GGC AGG GAC 


GAC CCA 


asp met leu 




glu 


leu 


ser 


ala 


ile 


met 


asn glu gin 


ile 


thr 


gly arg asp asp pro 
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6432/2201 6462/2211 

GCA AAG TGC GGA GTG GAG CGA CCC TAC ACC ACA CGT GTA ACT AGC AAG GGG ACC GCG GCT 
ala lys cys gly val glu arg pro tyr thr thr arg val thr ser lys gly thr ala ala 
6492/2221 6522/2231 

GTA CCT GTG GTC TCC ATC TCA TCC AGT GCG GAG GTT TGA GAG GAA CCC TGG AGC ATC CGA 
val pro val val ser ile ser ser ser ala glu val OPA 
6552/2241 6582/2251 

TGC ACC TCT CAG AGA ACT CTC TAA ATG TTT TGC AGC TAA TCT CGG GGA AGA AAA AGA TAG 



6852/2341 6882/23S1 

TCA AAG TGA ATG TTT TTA TAA CAT TTA AGC ATA TTT CCA ATG TAA ATA GAA GAT TGT AAA 



B. Moub« CLASP -2 cDNA segu« 
1/1 

CCA CTC GAC TAT GAA AAT GTC 
pro leu asp tyr glu asn val 
61/21 

CGG GAG ATG CTC CTC TTC CCT 
arg glu met leu leu phe pro 
121/41 

CGG TAC TTA CGC TCC ACG GTC 
arg tyr leu arg ser thr val 
181/61 

ACT GAG TGC ATC AAA ACC TAC 
thr glu cys ile lys thr tyr 



31/11 

ATC GTG CAG AAG AAG ACG 
lie val gin lys lys thr 
91/31 

TAT GAT GAC TTC CAG ACG 
tyr asp asp phe gin thr 

151/51 
CCT GCA AAT GCA GAG GAG 
pro ala asn ala glu glu 

211/71 
AAC TCT GAC TGG CAT CTT 
asn ser asp trp his leu 



CAG ATC CTA AAC GAC TGC CTG 
gin ile leu asn asp cys leu 

GCC ATC CTG AGG CGG CAG GGG 
ala ile leu arg arg gin gly 

GAA GCA CAG AGC CTG TTT GTC 
glu ala gin ser leu phe val 

GTG ACC TAT AAA TAT GAA GAT 
val thr tyr lys tyr glu asp 
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241/81 


















271/91 


















TAC TCA 


GGA 


GAG 


TTC 


CGA 


CAG 


CTT 


CCA 


AAC 


AAA GTG 


CCC 


AAG 


CTG 


GAT 


AAA 


CTT 


CCG 


GTC 


tyr ser 


gly 


glu 


phe 


arg 


gin 


leu 


pro 


asn 


lys val 


pro 


lys 


leu 


asp 


lys 


leu 


pro 


val 


301/101 


















331/111 
















CAC GTC 


TAT 


GAA 


GTC 


GAT 


GAG 


GAG 


GCC 


GAC 


AAA GAT 


GAG 


GAT 


GCT 


GCT 


TCC 


CTT 


GGG 


TCT 


his val 


tyr 


glu 


val 


asp 


glu 


glu 


ala 


asp 


lys asp 


glu 


asp 


ala 


ala 


ser 


leu 


gly 


ser 


361/121 


















391/131 
















CAG AAG 


GGC 


GGG 


ATC 


ACC 


AAG 


CAT 


GGC 


TGG 


CTG TAC 


AAA 


GGC 


AAC 


ATG 


AAC 


AGT 


GCC 


ATC 


gin lys 


gly 


gly 


ile 


thr 


lys 


his 


gly 


trp 


leu tyr 


lys 


gly 










ala 


ile 


421/141 


















451/151 


















AGC GTG 


ACG 


ATG 


AGG 


TCA 


TTC 


AAG 


AGG 


CGG 


TTT TTC 


CAC 


CTG 


ATT 


CAA 


CTT 


GGC 


GAT 


GGA 


ser val 


thr 


met 


arg 


ser 


phe 


lys 


arg 


arg 


phe phe 


his 


leu 


ile 


gin 


leu 


gly 


asp 




481/161 


















511/171 














TCC TAT 


AAT 


CTA 


AAC 


TTT 


TAT 


AAA 


GAT 


GAG 


AAG ATC 


TCC 


AAG 


GAA 


CCG 


AAA 


GGG 


TCC 


ATA 


ser tyr 


asn 


leu 


asn 


phe 


tyr 


lys 


asp 


glu 


lys ile 




lys 


glu 


pro 


lys 


gly 


ser 


ile 


541/181 


















571/191 


















TTC CTG 


GAT 


TCC 


TGC 


ATG 


GGT 


GTG 


ATC 


CAG 


AAC AAC 


AGA 


GTC 


AGG 


CGC 


TTC 


GCT 


TTT 


GAG 


phe leu 


asp 


ser 


cys 


met 


gly 


val 


ile 


gin 


asn asn 




val 




arg 


phe 


ala 




glu 


601/201 


















631/211 


















CTC AAG 


ATG 


CAA 


GAC 


AAA 


AGC 


AGT 


TAC 


CTT 


CTG GCG 


GCA 


GAC 


AGC 


GAG 


GCA 


GAG 


ATG 


GAG 


leu lys 


met 


gin 


asp 


lys 


ser 


ser 


tyr 


leu 


leu ala 


ala 








ala 


glu 


met 




661/221 


















691/231 












GAG TGG 


GTC 


ACT 


GTT 


CTC 


AAC 


AAG 


ATC 


CTC 


CAG CTC 


AAC 


TTT 


GAA 


GCC 


GCA 


ATG 


CAA 


GAA 


glu trp 


val 


thr 


val 


leu 


asn 


lys 


ile 


leu 


gin leu 




phe 


glu 


ala 


ala 


met 


gin 


glu 


721/241 


















751/251 














AAG CGA 


AAC 


GGG 


GAC 


CCT 


CAT 


GAA 


GAT 


GAC 


GAG CAG 


AGC 


AAA 


CTG 


GAA 


GGT 


TCT 


GGT 


TCC 


lys arg 




gly 


asp 


pro 


his 


glu 


asp 




glu gin 


ser 


lys 


leu 


glu 


gly 


ser 


gly 




781/261 


















811/271 


















GGT TTG 


GAC 


AGC 


TAC 


CTG 


CCT 


GAA 


CTT 


GCC 


AAG AGC 


ACC 


AGA 


GAA 


GCA 


GAG 


ATC 


AAA 


TTG 


gly leu 


asp 


ser 


tyr 


leu 


pro 


glu 


leu 


ala 


lys ser 


thr 




glu 


ala 


glu 


ile 


lys 


leu 


841/281 


















871/291 














AAA AGC 


GAG 


AGC 


AGA 


GTG 


AAG 


CTT 


TTT 


TAC 


TTG GAC 


CCA 


GAT 


ACC 


CAG 


AAA 


CTT 


GAC 


TTC 


lys ser 


glu 


ser 


arg 


val 


lys 


leu 


phe 


tyr 


leu asp 


pro 


asp 


thr 


gin 


lys 


leu 


asp 


phe 


901/301 


















931/311 


















TCA TCC 


GCT 


GAA 


CCA 


GAA 


GTG 


AAG 


CCC 


TTT 


GAA GAA 


AAG 


TTT 


GGG 


AAA 


AGG 


ATT 


CTC 


GTC 


ser ser 


ala 


glu 


pro 


glu 


val 


lys 


pro 


phe 


glu glu 


lys 


phe 


gly 


lys 


arg 


ile 


leu 


val 


961/321 


















991/331 


















AAG TGC 


AAC 


GAT 


TTG 


TCC 


TTT 


AAC 


CTG 


CAG 


TGC TGT 


GTC 


GCA 


GAG 


AAT 


GAG 


GAA 


GGA 


CCC 


lys cys 




asp 


leu 


ser 


phe 


asn 


leu 


gin 


cys cys 


val 


ala 


glu 


asn 


glu 


glu 


gly 


pro 


1021/341 
















1051/351 














ACG ACA 


AAT 


GTC 


GAG 


CCT 


TTC 


TTT 


GTC 


ACC 


CTG TCC 


CTG 


TTC 


GAC 


ATA 


AAA 


TAC 


AAC 


CGA 


thr thr 




val 


glu 


pro 


phe 


phe 


val 


thr 


leu ser 


leu 


phe 


asp 


ile 


lys 


tyr 


asn 


arg 


1081/361 
















1111/371 
















AAG ATT 


TCT 


GCT 


GAC 


TTC 


CAC 


GTG 


GAC 


CTG 


AAC CAC 


TTC 


TCA 


GTG 


CGG 


CAG 


ATG 


CTG 


GCC 


lys lie 




ala 


asp 




his 


val 




leu 


asn his 


L 




val 


arg 


gin 




leu 


ala 


1141/381 
















1171/39: 














CCC ACG 


TCC 


CCA 


GCT 


CTG 


ATG 


AAC 


GGT 


GGC 


CAG AGC 


CCA 




GCC 




CAG 


GAT 


GCC 


CTT 


pro thr 


ser 


pro 


ala 


leu 


met 




gly 


gly 


gin ser 




pro 


ala 


phe 


gin 


asp 


ala 


leu 


1201/401 
















1231/411 
















CAT ACA 


GCC 


ATG 


CAG 


TAT 


CCG 


AAG 


CAG 


GGA 


ATA TTT 


TCA 


GTC 


ACG 


TGT 


CCT 


CAC 


CCA 


GAC 


his thr 






gin 


tyr 


pro 


lys 




gly 


ile phe 


ser 


val 


thr 


cys 


pro 


his 


pro 


asp 


1261/421 
















1291/431 
















ATA TTT 


CTT 


GTG 


GCC 


AGA 


ATT 


GAG 


AAG 


GTC 


CTC CAA 


GGA 


AGC 


ATC 


ACG 


CAC 


TGT 


GCT 


GAA 


ile phe 


leu 


val 


ala 


arg 


ile 


glu 


lys 


val 


leu gin gly 


ser 


ile 


thr 


his 


cys 


ala 


glu 


1321/441 
















1351/451 












CCG TAT 


ATG 


AGA 


AGC 


TCA 


GAC 


TCT 


TCT 


AAG 


GTT GCC 


CAG 


AAG 


GTG 


CTG 


AAG 


AAT 


GCC 


AAG 


pro tyr met 


arg 


ser 


ser 


asp 


ser 


ser 


lys 


val ala gin 


lys 


val 


leu 


lys 


asn 


ala 


lys 
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1381/461 1411/471 

CAG GCA TGC CAA AGA CTA GGA CAG TAC AGA ATG CCA TTC GCC TGG GCA GCA AGG ACG TTG 
gin ala cys gin arg leu gly gin tyr arg met pro phe ala trp ala ala arg thr leu 
1441/481 1471/491 

TTT AAA GAC ACC TCT GGA AAC CTG GAT AAA AAC GCC AGA TTT TCT GCC ATC TAC CGG CAA 
phe lys asp thr ser gly asn leu asp lys asn ala arg phe ser ala lie tyr arg gin 
1501/501 1531/511 

GAC AGC AAT AAG CTT TCA AAT GAT GAC ATG CTC AAG CTG CTC GCA GAC TTC CGG AAA CCT 
asp ser asn lys leu ser asn asp asp met leu lys leu leu ala asp phe arg lys pro 
1561/521 1591/531 

GAG AAG ATG GCC AAA CTC CCA GTG ATT TTA GGC AAT CTA GAC ATT ACA ATT GAC AGT GTT 
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5551/1851 
















GAG AGG AGA 




GAG 


TTT 


GAA 


CGA 


TGT 


CAC 


AAC ATC CGG 


CGC 


TTC 


ATG 


TTT 


GAG 


ATG 


CCC 




thr 
r 


g u 


P « 


g u 


arg 


cys 


s 


asn ile arg 


arg 


phe 


met 


phe 


glu 


met 




5581/1861 
















5611/1871 


















ACC 


GGG 


AAG 


AGG 


CAG 






GTG GAG GAG 


CAG 




AAG 






ACC 


ATC 


phe thr gin 


thr 


gly 


lys 


arg 


gin 


gly 


gly 


val glu glu 


gin 


cys 


lys 


arg 


arg 


thr 


ile 


5641/1881 
















5671/1891 
















CTG ACA GCA 


ATA 


CAC 


TGC 




CCC 


TAC 


GTG 


AAG AAG CGG 


ATC 


CCT 


GTC 


ATG 


TAC 


CAG 


CAC 


leu thr ala 


ile 


his 


cys 


phe 


pro 


tyr 


val 


lys lys arg 


ile 




val 




tyr 


gin 


his 


5701/1901 
















5731/1911 
















CAC ACT GAC 


CTG 


AAC 


CCC 


ATT 


GAG 


GTG 


GCC 


ATC GAT GAA 


ATG 


AGC 


AAG 


AAG 


GTG 


GCC 


GAG 


his thr asp 


leu 






ile 


glu 


val 




ile asp glu 






lys 


lys 


val 


ala glu 


5761/1921 
















5791/1931 
















CTC CGC CAG 


CTC 


TGC 


TCG 


TCG 


GCT 


GAA 


GTG 


GAT ATG ATC 


AAA 


CTG 


CAG 


CTC 


AAA 


CTG 


CAG 


leu arg gin 


leu 


cys 






ala 


glu 


val 


asp met ile 


lys 


leu 


gin 


leu 




leu 


gin 


5821/1941 
















5851/1951 
















GGC AGT GTG 


AGC 


GTC 


CAG 


GTC 


AAT 


GCT 


GGT 


CCG CTA GCA 


TAT 


GCC 


CGA 


GCC 


TTC 


CTC 


GAT 






val 


gin 


val 






gly 


pro leu ala 


tyr 


ala 


arg 


ala 




leu 


asp 


5881/1961 
















5911/1971 
















GAC ACC AAC 


ACA 


AAA 


AGA 


TAC 


CCT 


GAC 


AAT 


AAG GTG AAA 


CTG 


CTG 


AAG 


GAA 


GTT 


TTC 


AGG 






lys 




tyr 








lys val lys 


leu 


leu 


lys 


glu 


val 




arg 



It . 
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5941/1981 












5971/1991 












CAA TTT GTG 


GAA GCT 


TGT 


GGT 


CAA 


GCC 


TTG GCA GTG AAC 


GAG 


CGT 


CTC ATT AAA 


GAA 


GAC 


gin phe val 


glu ala 


cys 


gly 


gin 


ala 


leu ala val asn 


glu 


arg 


leu ile lys 




asp 


6001/2001 












6031/2011 












CAA CTG GAG 


TAC CAG 


GAA 


GAG 


ATG 


AAG 


GCC AAC TAC AGG 


GAG 


ATG 


GCC AAG GAG 


CTC 


TCC 


gin leu glu 


tyr gin 


glu glu 










met 


ala lys glu 


leu 


ser 


6061/2021 












6091/2031 9 












GAC ATC ATG 


CGT GAG 


CAG 


ATT 


TGC 


CCC 


CTG GAG GAG AAG 


ACA 


AGC 


GTG CTA CCA 


AAT 


TCC 


asp lie met 


arg glu 


gin 


lie 


cys 




leu glu glu lys 


thr 


ser 






ser 


6121/2041 












6151/2051 












CTG CAC ATC 


TTC AAC 


GCC 


ATC 


AGC 


GGG 


ACA CCA ACA AGC 


ACA 


GTG 


GTT CAA GGG 


TTG 


ACC 


leu his lie 


phe asn 


ala 


ile 


ser gly thr pro thr ser 


thr val val gin gly lea thr 


6181/2061 












6211/2071 












AGC TCG TCC 


TCA GTT 


GTG 


TGA 




TAC 


CTC ATG AAC CGT 


GTG 


TGG 


GGA CAT GCT 




TCA 


ser ser ser 


ser val 


val 


OPA 


















6241/2081 












6271/2091 












TGT GCA AAC 


TCA GGA 


TGA 




CCA 


GAG 


CTA ATC ACT GGT 


GTG 


GCC 


AAG CAC AGG 


AAG 


AAG 


6301/2101 












6331/2111 












CCA TGG GGA 


ATG GGA 


GAG 


AGA 


AGG 


AGC 


CTG GAC TGT GAT 


ATT 


TAA 


TAG CAG ATT 


TTA 


TAG 


6361/2121 












6391/2131 












GAG TCG GGG 


GAA GGT 


GCA 


CAT 


AGT 




TTA AAT CTC ACT 


GGC 


AAT 


ATT TAG TTT 


TCC 


TCA 


6421/2141 
























TGT CTT AAC 


AGG TAT 


ATG 


TGG 


ATA 


CTC 


TTG 













C. Mouse CLASP-3 cDNA sequence 

1/1 31/11 



GTT TCA 


GCT 


GTG 


CCT 


GAA 


GAA 


AGT 


GAA ATG GAC CCA 


CAT 




AGA 


GAT 


TGT 


ATA AGA AGT 


val ser 


ala 


val 






glu 




glu met asp pro 


his 


val 


arg 


asp 


cys 


lie arg ser 


61/21 














91/31 














TAC ACA 


GAA 


GAC 


TGG 


GCA 




GTG 


GTC AGA AAA TAC 


CAT 


AAA 


CTG 


GGA 


ACA 


GGA TTT AAC 


tyr thr 


glu 


asp 


trp 




val 


val 


val arg lys tyr 


his 


lys 


leu 


gly 


thr gly phe asn 


121/41 














151/51 














CCC AAC 


ACA 


TTG 


GAT 


AAG 


CAG 


AAA 


GAA AGG CAA AAA 


GGA 


CTG 


CCC 


AGA 


CAA 


GTC TTT GAA 


pro asn 


thr 


leu 


asp 


lys 


gin 


lys 


glu arg gin lys 


gly 


leu 




arg gin 


val phe glu 


181/61 














211/71 














TCT GAT 


GAA 


GCT 


CCC 


GAT 


GGC 


AGC 


AGC TAC CAG GAT 


GAG 


CAA 


GAT 


GAT 


CTT 


AAA AGA CGT 


ser asp 


glu 


ala 


pro 


asp 


gly 


ser ser tyr gin asp glu 




asp 


asp 


leu 


lys arg arg 


241/81 














271/91 














TCG ATG 


TCA 


ATA 


GAT 


GAT 


ACT 


CCA 


AGG GGT AGC TGG 


GCC 


TGC 


AGT 


ATC 




GAC TTA AAA 






ile 


asp 


asp 


thr 


pro arg gly ser trp ala 


cys 




ile phe 


asp leu lys 


301/101 














331/111 














AAT TCA 


CTT 


CCT 


GAT 


GCC 


TTG 


CTT 


CCT AAT TTA CTT 


GAT 


CGA 


ACT 


CCA 


AAT 


GAA GAA ATA 




leu 




asp 


ala 


leu 


leu 


pro asn leu leu 


asp 


arg 


thr 






glu glu ile 


361/121 














391/131 














GAC CAT 


CAG 


AAT 


GAT 


GAC 


CAA 


AGG 


AAA TCA AAC CGC 


CAC 


AAG 


GAA 


CTG 


TTT 


GCT TTG CAT 




gin 


asn 


asp 


asp 


gin 


arg 


lys ser asn arg 


his 


lys glu 


leu 




ala leu his 


421/141 














451/151 














CCA TCA 


CCA 


GAT 


GAG 


GAA 


GAA 


CCA 


ATC GAA AGG CTC 


AGT 


GTT 


CCT 


GAT 


GTG 


CCC AAA GAG 




pro 


asp 




glu 


glu 


pro 


ile glu arg leu 




val 


pro 


asp 


val 


pro lys glu 


481/161 














511/171 














CAT TTC 


GGA 


CAA 


AGA 


CTT 


CTT 


GTG 


AAA TGT TTA TCA 


CTC 


AAG 




GAG 


ATT 


GAA ATT GAG 


his phe 


gly 


gin arg 


leu 


leu 


val 


lys cys leu ser 


leu 


lys phe glu 


ile glu ile glu 



lb 

Figure/T (13 of 35) 



14/95 



541/181 


















571/191 


















CCC ATT 


TTT 


GCA 


AGT 


TTG 


GCT 


TTA 


TAT 






















He 


phe 
p e 


ala 
a a 


ser 
S 


le 
eu 


a a 


eu 




asp 




glu 


lys 


lys 


lys 


ile 


ser 


glu 


asn 


601/201 


















fill /? i 

5 i ? 


















TTT TAT 


TTC 


GAC 


CTT 


AAT 




GAG 






AAG GGA 


CTG 


TTA 


CGA 


CCG 


CAT 


GTG 


CCC 


CCT 




P e 


asp 


leu 


asn 


6 


g u 


g n 


met 


lys gly 


eu 


leu 


arg 


pro 


his 


val 


pro 


pro 


661/221 


















691/231 


















______ 












AG 


TCA 


GCC 




TCT 


ATA 


ACT 


TAT 


CCT 


TCC 


CAA 


GAT 


ala ala 


lie 


th 


r 


eu 


a a 


arg 


ser 


a a 


ile phe 




ile 


thr 


tyr 


pro 


ser 


gin 




721/241 


















751/251 
















GTC TTT 












GAA 


AAA 


GTC 


CTT CAG 


CAA 


GGA 


GAC 


ATT 


GGA 


GAG 


TGT 


GCA 




leu 


va 


e 


ys 


eu 


g u 


lys 


val 




gin 


gly 


asp 


ile 


gly 


glu 


cys 


ala 


781/261 


















811/271 


















GAA CCA 


TAT 










GAA 


GCT 


GAT 


GCC ACC 


AAG 


AAT 


AAG 


GAA 


AAA 


TTG 


GAG 


AAA 


glu pro 


tyr 


met 


ile 


phe 


lys 


glu 


ala 


asp 




lys 


asn 


lys 


glu 


lys 


leu 




lys 


841/281 


















871/291 
















TTG AAA 


AGT 




GOT 




CAA 


TTT 


TGC 


CAG 


AGA CTT 


GGG 


AAA 


TAC 


CGC 


ATG 


CCT 


TTT 


GCT 


leu lys 


ser 


g n 


a a 


asp 


gin 


phe 


cys 


gin 


arg leu 


gly 


lys 


tyr 


arg 


met 


pro 


phe ala 


901/301 


















931/311 


















TGG ACT 


GCG 


ATT 




TTA 


ATG 


AAT 


ATT 


GTT 


AGC AGT 


GCT 


GGC 


AGC 


TTA 


GAA 


AGA 


GAC 


TCC 


trp thr 


ala 


e 




eu 


met 


asn 


ile 


val 


ser ser 


ala 


gly 


ser 


leu 


glu 


arg 


asp 


ser 


961/321 


















991/331 
















ACA GAA 


GTC 


GA 




T 


A T 


GGA 


GAA 


CGG 


AAA GGT 


TCT 


TGG 


TCA 


GAG 


AGG 


AGG 


AAT 


TCT 


thr glu 


val 


glu 


e 


ser 


fc r 


gly 


glu 


arg 


lys gly 




trp 


ser 


glu 


arg 


arg 






1021/341 
















1051/351 




















GGC 








CTG 


GAG 


AGG 


ACG ACC 


AGT 


GGG 


GAT 


GAT 


GCT 


TGC 


AAC 


TTG 


ser leu 


val 


9 y 


arg 


arg 




leu 


glu 


arg 


thr thr 




gly 


asp 


asp 


ala 


cys 




leu 


1081/361 
















1111/371 
















ACC AGC 


TTC 








C 


CTC 


ACT 


GTG 


GCA AAC 


TTT 


TTT 


AAG 


CAG 


GAA 


GGA 


GAC 


CGT 






trp 


pro 


a a 


fc r 


eu 


thr 


val 


ala asn phe 


phe 


lys 


gin 


glu 


gly 


asp 


arg 


1141/381 
















1171/391 
















TTA AGT 


GAT 


GAG 


GAC 


CTG 


TAC 




TTC 


CTC 


GCT GAT 


ATG 


AGA 


AGG 


CCG 


TCT 


TCT 


GTC 


TTA 


leu ser 


asp 


g u 


asp 


leu 


yr 


ys 


p e 


eu 


ala asp 


met 


arg 


arg 


pro 


ser 


ser 


val 


leu 


1201/401 
















1231/411 
















CGG AGG 


CTG 


AGA 


CCA 


ATT 


ACA 


GCT 




CTG 


AAG ATA 


GAC 




TCT 








GAA AAT 


arg arg 


leu 


ar 




ile 


thr 


ala 
a 


g n 


le 
eu 


lys ile 


asp 


e 


ser 


pro 


a a 


pro 


glu 


asn 


1261/421 
















1291/431 
















CCC CAT 


TAC 


TGC 


CTA 


ACC 


CCA 


GAG 


CTG 


CTT 


CAG GTG 


AAG 












CGA 


GTC 


pro his 


tyr 


CVS 


leu 


thr 


pro 


g u 


leu 


leu 


gin val 


lys 


eu 




pro 


asp 


ser 


arg 


val 


1321/441 
















1351/451 
















AGG CCC 


ACC 


AGA 


GAA 


ATT 


TTA 


GAA 


TTT 


CCT 


GCC AGG 


GAT 


GTC 


TAT 


GTT 


CCA 




ACA 


ACT 


arg pro 


thr 


arg 


glu 


ile 


leu 


glu 


phe 


pro 






val 


tyr 


val 


pro 


asn 


thr 


thr 


1381/461 
















1411/471 
















TAC AGA 


AAT 




CTC 


TAC 


ATT 


TAC 


CCT 


CAA 




AAT 


TTT 


GCC 


AAC 


CGT 


CAA 


GGT 


TCT 


tyr arg 




leu 


leu 


tyr 


ile 


tyr 




gin 


ser leu 






ala 




arg 


gin 


gly 


ser 


1441/483 


L 
















1471/491 
















GCC AGA 


AAC 


ATC 


ACA 


GTG 


AAA 


GTC 


CAG 




ATG TAT 


GGG 


GAA 


GAC 


CCG 


AGC 


AAC 


GCA 


ATG 


ala arg 




ile 


thr 


val 


lys 


val 


gin 


phe 


met tyr 


gly 




asp 


pro 


ser 


asn 


ala 




1501/501 
















1531/511 
















CCG GTA 


ATC 




GGT 


AAA 


TCT 


AGC 


TGT 


TCA 


GAA TTT 


TCA 


AAG 


GAA 


GCC 


TAT 


ACA 


GCC 


GTA 


pro val 


ile 




gly 


lys 


ser 






ser 


glu phe 




lys 


glu 


ala 


tyr 


thr 


ala 


val 


1561/521 
















1591/531 
















GTA TAT 


CAT 


AAC 


AGG 


TCT 


CCT 


GAT 




CAC 


GAG GAA 


ATC 


AAA 


GTC 


AAG 


CTC 


CCC 


GCC 


ACG 


val tyr 


his 




a eg 


ser 


pro 


asp 


phe 


his 


glu glu 


ile 


lys 


val 


lys 


leu 


pro 


ala 


the 


1621/541 
















1651/551 
















CTA ACT 


GAC 


CAC 


CAC 


CAC 


TTG 


CTC 




ACT 


TTT TAT 


CAT 




AGT 


TGC 


CAG 


CAG 


AAA 


CAA 


leu thr 


asp 


his 


his 


his 


leu 


leu 




thr 


phe tyr 


his 


val 


ser 


cys 


gin 


gin 


lys 


gin 
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1681/561 
















1711/571 
















AAC ACT CCT 


CTC 


GAA 


ACC 


CCA 


GTC 


GGA 


TAT 


ACA TGG ATA 


CCA 


ATG 


CTG 


CAG 


AAT 


GGA 


CGG 


asn thr pro 


leu glu 


thr 


pro 


val 


gly 


tyr 


thr trp ile 


pro 


me 


1 u 
eu 


g n 




gly arg 


1741/581 
















1771/591 
















TTG AAG ACA 


GGG 


CAG 


TTC 


TGC 


CTG 


CCA 


GTC 


TCC CTG GAA 


AAA 


CCA 


CCG 


CAG 


GCT 


TAT 


TCA 


leu lys thr 


gly gin 


phe 


cys 


leu 


pro 


val 




ys 


pro 


pro 


- 

g n 


ala 


tyr 


ser 


1801/601 
















1831/61i g 














GTG CTT TCC 


CCC 


GAG 


GTC 


CCT 


CTA 


CCT 


GGT 


ATG AAG TGG 


GTG 


GAC 


AAC 


CAC 


AAA 


GGA 


GTA 


val leu ser 




val 


pro 


leu 


pro 


- 

g 




v 
va 


asp 


asn 


S 


lys 


gly 


val 


1861/621 
















1891/631 












TTT AAT GTT 


GAA 


GTT 


GTT 


GCT 


GTT 


TCA 


TCC 




CAA 


GAC 


CCT 


TAT 


CTA 


GAC 


AAA 




glu 


val 


val 


ala 


val 


ser 


ser 


lie his thr 


g n 


asp 


pro 




leu 


asp 


lys 


1921/641 






























TTC TTT GOT 


CTT 


GTC 


AAT 


GCT 


CTG 


GAT 


GAA 


CAC ATG TTC 


CCG 


GTC 


CGA 


ATT 


GGG 


GAC 


ATG 




leu 


val 


asn 


ala 


leu 


as 

P 


g u 




pro 


val 


arg 


ile 


gly 


asp 


met 


1981/661 
















2011/671 














TGG ATC ATG 


GAA 


AAC 


AAC 


TTA 


GAG 


AGT 


GAA 






ATT 






TTG 


AAT 


TCC 


trp lie net 


glu 


asn 


asn 


leu 




ser 


g u 


leu" lys ser 






ser 


a a 


leu 


asn 


ser 


2041/681 
















2071/691 
















TCA CAG CTG 


GAA 


CCC 


GTA 


GTC 


CGG 


TTT 


CTT 


CAT CTT CTG 


CTT 


GAT 


AAA 


CTC 


ATA CTT 


TTA 




glu 


pro 


val 


val 




phe 
p e 


leu 




leu 


asp 


lys 


eu 


ile 


leu 


leu 


2101/701 
















2131/711 1CU 












GTT GTT AGA 


CCA 


CCT 


GTG 


ATT 


GCT 


GGC 


CAA 




CTA 


GGG 




GCA 


TCT 




GAA 


val val arg 


pro 


pro 


val 


lie 


ala 


- 

Y 


g n 


ile val 


leu 


gly 


g n 


ala 




phe glu 


2161/721 
















2191/73i aSn 












GCT ATG GCG 


TCA 


ATT 


ATA 


AAT 


CGA 


CTT 


CAC 


AAA AAC TTA 






AAT 


CAT 


GAC 


CAG 


CAT 


ala met ala 


ser 


lie 


lie 


asn 


arg 


leu 


his 


lys asn leu 


g u 


g y 


asn 


his 


asp gin 


his 


2221/741 
















2251/751 














GGC AGA AAC 


AAC 


CTT 


CTC 


GCA 


TCG 


TAT 


ATC 


TAT TAT GTG 


TTC 


CGC 


CTG 


CCA 


AAT 


ACT 


TAC 


gly arg asn 


asn 


leu 


leu 


ala 


ser 


tyr 


lie 




P e 


arg 


eu 


pro 


asn 


thr 


tyr 


2281/761 
















2311/771 Val 








CCT AAT TCA 


CCG 




CCA 


GGT 


CCC 


GGG 


GGT 


TTG GGA GGA 


TCA 


GTG 


CAT 


TAT 


GCC ACA 


ATG 




pro 


ser 


pro 


gly 


pro 


gly 


- 




ser 


va 


S 




ala 


thr 


met 


2341/781 
















2371/791^ ^ 
















GCT AGG TCT 


GCA 


GTG 


AGA 


CCT 


GCG 


AGC 


CTT 


AAC TTA AAT 


CGG 


TCT 


CGA 


AGC 


CTT 


AGC 


AAC 


ala arg ser 


ala 


val 


arg 


pro 


ala 


ser 


leu 




arg 


ser 


arg 


ser 


leu 






2401/801 
















2431/81i aSn 












AGT AAC CCA 


GAT 


ATA 


TCT 


GGG 


ACT 


CCC 


ACA 


TCA CCA GAT 


GAT 


GAA 


GTT 


CGG 


TCA 


ATC 


ATT 


ser asn pro 


asp 


lie 


ser 




thr 


pro 






asp 


g u 


val 


arg 


ser 


ile 


ile 


2461/821 
















2491/831 










GGC AGT AAG 


GCT 


ATG 


GAT 


CGA 


AGT 


TGT 


AAT 


CGT ATG TCT 


TCG 


CAC 


ACA 


GAG 


ACG 


TCA 


AGT 


gly ser lys 


ala 


met 


asp 


arg 




cys 




arg met ser 


ser 


his 


thr 


glu 


thr 




ser 


2521/841 
















2551/851 














TTC TTA CAA 


ACA 


TTA 


ACG 


GGA 


CGC 


TTA 


CCA 


ACT AAA AAG 


CTT 




CAC 


GAG 


GAG 


CTG 


GCT 


phe leu gin 


thr 


leu 


thr 


gly 


arg 


leu 




thr lys lys 


leu 




his 


glu 


glu 


leu 




2581/861 
















2611/871 














TTG CAG TGG 


GTC 




TGC 


AGT 


GGC 


AGC 


GTT 


CGG GAG TCG 


GCT 


TTG 


CAG 


CAA 


GCC 


TGG 


TTC 


leu gin trp 


val 


val 


cys 




gly 




val 


arg glu ser 




leu 


gin 






trp 


phe 


2641/881 
















2671/891 
















TTT TTT GAG 


TTG 


ATG 


GTA 


AAG 


AGC 


ATG 


GTG 


CAC CAC TTG 


TAC 




AAT 


GAT 


AAG 


CTC 


GAC 


phe phe glu 


leu 




val 


lys 






val 


his his leu 


tyr 


phe 


asn 


asp 


lys 


leu 


asp 


2701/901 
















2731/911 










GCT CCG AGG 


AAA 


AGT 


CGC 




CCA 


GAA 


CGT 


TTT ATG GAT 


GAT 


ATT 


GCT 


GCC 


CTG 


GTT 


AGC 


ala pro arg 


lys 


ser 


arg 




pro 


glu 


arg 


phe met asp 


asp 


ile 


ala 


ala 


leu 


val 


ser 


2761/921 
















2791/931 














ACA ATT GCT 


GGC 


GAT 


GTC 


GTG 


TCG 


CGG 


TTT 


CAG AAG GAT 


ACA 


GAA 


ATG 


GTT 


GAG 


AGA 


CTC 


thr lie ala 


gly 


asp 


val 


val 


ser 


arg 


phe 


gin lys asp 


thr 


glu 


met 


val 


glu 


arg 


leu 



it 
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16/95 



2821/941 












AAC ACA AGC 


CTT 


GCC 


TTC 


TTT 


CTC 


asn thr ser 


leu 


ala 


phe 


phe 


leu 


2881/961 












TTT AGC CTC 


ATA 


AAG 


TCC 


TGC 


TAC 


phe ser leu 


He 


lys 


ser 


cys 


tyr 


2941/981 












CCA AGT GTC 


CTG 


GTG 


TCC 


CTG 


AGG 


pro ser val 


leu 


val 


ser 


leu 


arg 


3001/1001 












TAT GTT ACA 


TTA 


AAC 


TTA 


CCC 


TGC 


tyr val thr 


leu 


asn 


leu 


pro 


cys 


3061/1021 












GTT TCT TCT 


GCA 


ACA 


TCT 


CAG 


AGC 


val ser ser 












3121/1041 












GCA AAC ATG 


TTC 


GAA 


TTG 


TCC 


TTA 


ala asn met 


phe 


glu 


leu 


ser 


leu 


3181/1061 












CTG ACT GAG 


CTT 


GCA 


CTC 


ATT 


CTA 


leu thr glu 


leu 


ala 


leu 


ile 


leu 


3241/1081 












AAA GTC ATC 


AAT 


ATG 


GTA 


CAC 


AAC 


lys val lie 


asn 


met 


val 


his 


asn 


3301/1101 












GAC CCT CAG 


ATA 


AAA 


GCT 


CGG 


GTG 


asp pro gin 


ile 


lys 


ala 


arg 


val 


3361/1121 












GAA ACT GTA 


CCT 


CAA 


CTG 


TAT 


GAT 


glu thr val 


pro 


gin 


leu 


tyr 


asp 


3421/1141 












TGT ATA GCC 


CCT 


GAT 


GAC 


TAT 




cys He ala 


pro 


asp 


asp 


tyr 


asp 


3481/1161 












ATG GCA ATT 


GCA 


GGA 


ACA 


TCC 


GTC 


met ala ile 


ala 


gly 


thr 


ser 


val 


/I t Ql 
liol 












TCA ACG AGT 


GGC 


CGG 


CAG 


CAC 


ACC 




gly 


arg 


gin 


his 


thr 


3601/1201 














TGG 


GTT 


CTC 






cys leu leu 


trp 


val 


leu 


lys 


asn 


3661/1221 












CTC TCA GTC 


CTC 


CAG 


CTG 


AAC 


CGG 


leu ser val 


leu 


gin 


leu 




arg 


3121/1241 










GAG TAC AAA 


GGA 


AAA 


AAG 


GTA 




glu tyr lys 


gly 


lys 


lys 


val 


phe 


3781/1261 










GAC ATG AGA 


GCC 


AAG 


CTT 


GAA 


GAA 


asp met arg 


ala 


iys 


leu 


glu 


glu 


3841/1281 












GTA CGG CGG 


AGC 


CGA 


GGG 


CAG 


CTC 


val arg arg 


ser 


arg 


gly 


gin 


leu 


3901/1301 












GAA AAC CTG 


CGG 


TGG 


AGA 


AAG 


GAC 


glu asn leu 


arg 


trp 


arg 


lys 


asp 







2851/951 
















AAT 


GAC 


CTG CTG TCA 


GTT 


ATG 


GAC 


AGG 


GGG 


TTT 


GTT 


asn 


asp 


leu leu ser 


val 


met 


asp 


arg 


gly 


phe val 






2911/911 












AAA 


CAG 


GTA TCA GCA 


AAG 


CTG 


TAT 


TCA 


CTG 


CCA 


AAT 


lys 


gin 


val ser ala 


lys 


leu 


tyr 


ser 


leu 


pro 


asn 






2971/991 
















TTG 


GAC 


TTT TTG CGA 


ATC 


ATC 


TGC 


AGC 


CAT 


GAG 


CAC 


leu 


asp 


phe leu arg 


ile 


ile 


cys 


ser 


his 


glu 


his 






3031/1011 














AGC 


TTG 


CTC ACC CCG 


CCT 


GCA 


TCT 


CCA 


TCA 


CCT 


TCT 


ser 


leu 


leu thr pro 


pro 


ala 


ser 


pro 


ser 


pro 


ser 






3091/1031 














TCT 


GGA 


TTC TCC ACC 


AGT 


GTG 


CAG 


GAC 


CAG 


AAG 


ATT 


ser 


gly 


phe ser thr 


ser 


val 


gin 


asp 


gin 


lys 


ile 






3151/1051 
















CCC 


TTC 


CGC CAG CAG 


CAT 


TAC 


CTG 


GCA 


GGC 


CTC 


GTG 


pro 


phe 


arg gin gin 


his 


tyr 


leu 


ala 


gly 


leu 


val 






3211/1071 
















GAC 


CCT 


GAC GCT GAA 


GGA 


CTG 


TTT 


GGA 


TTG 


CAT 


AAG 


asp 


pro 


asp ala glu 


gly 


leu 


phe 


gly 


leu 


his 


lys 






3271/1091 














TTA 


CTG 


TCC ACT CAC 


GAC 


TCT 


GAC 


CCA 


CGG 


TAC 


TCT 


leu 


leu 


ser thr his 


asp 


ser 


asp 


pro 


arg 


tyr 


ser 






3331/1111 
















GCT 


ATG 


TTG TAC CTA 


CCT 


CTT 


ATT 


GGT 


ATT 


ATC 


ATG 


ala 


met 


leu tyr leu 


pro 


leu 


ile 


gly 


ile 


ile 


met 






3391/1131 














TTT 


ACA 


GAA TCA CAC 


AAT 


CAA 


CGA 


GGA 


AGG 


CCA 




phe 


thr 


glu ser his 


asn 


gin 


arg 


gly 


arg 




ile 






3451/1151 
















AGT 


GAG 


AGT GGG AGC 


ATG 


ATA 


AGC 


CAG 


ACG 




GCC 


ser 


glu 


ser gly ser 


met 


ile 


ser 


gin 


thr 


val 


ala 






3511/1171 
















CCT 


CAG 


CTG ACA AGA 


CCT 


GGC 


AGT 


TTT 


CTC 


CTC 


ACG 






leu thr arg 


pro 


gly 


ser 




leu 


leu 


thr 






3571/1191 
















ACC 


TTC 


TCA GCA GAG 


TCA 


AGC 


AGG 


AGC 


CTT 


TTG 


ATC 


thr 


phe 


ser ala glu 


ser 


ser 


arg 


ser 


leu 


leu 


ile 






3631/1211 
















GCA 




GAA ACG GTT 












ACA 


GAT 


ala 


asp 


glu thr val 


leu 


gin 


lys 


trp 


phe 


thr 


asp 






3691/1231 
















TTG 


TTA 


GAT CTG CTT 


TAC 


CTG 


TGT 


GTA 


TCT 


TGC 


TTT 


leu 


leu 


asp leu leu 


tyr 


leu 


cys 


val 


ser 


cys 


phe 






37S1/1251 
















GAA 


CGA 


ATG AAT AGT 


TTA 


ACT 




AAG 


AAA 


TCA 


AAA 


glu 




met asn ser 


leu 


thr 




lys 


iys 








3811/1271 
















GCC 


ATT 


CTG GGG AGC 


ATC 


GGT 


GCC 


AGG 


CAG 


GAA 


ATG 


ala 


ile 


leu gly ser 


ile 


gly 


ala 


arg 


gin 


glu 


met 






3871/1291 
















GAG 


AGG 


AGC CCA TCT 


GGA 


AGC 


GCC 




GGG 


AGC 


CAA 


glu 




ser pro ser 


gly 


ser 


ala 




gly 




gin 






3931/1311 
















ATG 


ACT 


CAC TGG CGT 


CAG 


AAC 


TCA 


GAG 


AAA 


CTC 


GAT 




thr 


his trp arg 


gin 






glu 


lys 


leu 


asp 



It 
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3961/1321 






AAG TCA AGA 


GCA 


GAG 


lys ser arg 


ala 


glu 


4021/1341 






AAC CTC ATC 


ATC 


TTA 


asn leu lie 


ile 


leu 


4081/1361 






AAA GAG AGT 


ATC 


CTG 


lys glu ser 


ile 


leu 


4141/1381 






AGT GCA GTC 


TAT 


CTG 


ser ala val 


tyr 


leu 


4201/1401 




GAG CTC TTG 


TTT 


GAG 


glu leu leu 


phe 


glu 


4261/1421 






CAT TGC AGT 


AGC 


AGC 


his cys ser 






4321/1441 






ATG AGG GAG 


AAC 


TTT 


met arg gin 


asn 


phe 


4381/1461 






TCA CTG TCC 


TCT 


TTG 


ser leu ser 




leu 


4441/1481 






CTA AAA ACT 


ATT 


TTG 


leu lys thr 


ile 


leu 


4501/1501 






GAT CAG GTC 


CAA 


GAT 


asp gin val 


gin 


asp 


4561/1521 






AAG GAG CAT 


CAG 


GAA 


lys glu his 


gin 




4621/1541 






TAC CAG ACG 


TCT 


CCA 


tyr gin thr 




pro 


4681/1561 






GAA CGA AGC 


AAT 


CAT 






his 


4741/1581 






GAG TAT CTG 


AGC 


ATG 


glu tyr leu 


ser 


met 


4801/1601 






AAT ATT TCA 


TCT 


AAT 


asn ile ser 






4861/1621 






GAA GAA GGC 


ATT 


TGC 


glu glu gly 


ile 


cys 


4921/1641 






CAA GCG GCT 


GCT 


TCC 


gin ala ala 


ala 




4981/1661 






GTA CTC ATT 


CCT 


ATT 


val leu ile 




ile 


5041/1681 






AAA CTT CAG 


GAG 


GCA 


lys leu gin 


glu 


ala 













3991/1331 


ATA 


GAA 


CAC 


GAA 


GCA 




ile 


glu 


his 


glu 


ala 


leu ile 












4051/1351 


GAC 


ACG 


CTA 


GAG 


ATC 


ATC GTC CAG 




thr 


leu 


glu 


ile 


lie val gin 














GGT 


GGT 


GTG 


CTA 


AAA 


GTG CTG CTA 


gly 


gly 


val 


leu 


















CAG 


CAC 


TGC 


TTC 




ACG CAG AGA 


gin 


his 






ala 














4231/1411 


GAA 


GAG 


ACA 


GAG 


CAG 


TGT GCG GAT 




ol 
g u 


thr 


g u 


g n 


cys ala asp 












4291/1431 


ATC 


AGT 


ACA 


ATC 




TCG CAC GCT 


ile 


ser 


thr 




arg 


ser his ala 














GAG 


ATC 


GGG 


AAT 


AAC 


TTT GCC AGA 


glu 


ile 


glv 


asn 


asn 














4411/1471 












AAT TTT AAT 


val 


alv 


thr 


er 


9 n 


a *| n ** e asn 












4471/1491 


ACA 


TAT 


GCT 


GAA 


GAA 




thr 


tyr 


ala 


alu 


g u 


asp leu glu 












4531/1511 


TTG 


GTC 


TTC 


AAT 


CTC 


CAT ATG ATC 


leu 


val 


phe 


asn 


leu 
eu 














4591/1531 


GAC 


CCA 


GAA 


ATG 


CTG 




asp 


pro 


clu 


met 


leu 


ile asp leu 












4 651/1551 


GAC 


CTG 


CGC 


CTG 


ACC 






leu 


arg 


leu 


thr 


trp leu gin 












4711/1571 


GCC 


GAA 


GCT 


GCT 


CAG 


TGC CTG GTC 


ala 


glu 


ala 


ala 


gin 














4771/1591 


CTG 


GAA 


GAT 


CGG 


AAG 




leu 


glu 


asp 


arg 


lys 


tyr leu pro 












4831/1611 


GTA 


TTG 


GAA 


GAA 


TCT 


GCT GTC TCA 


val 




glu 


glu 




ala val ser 












4891/1631 


TCT 


GGA 


AAG 


TAC 


TTC 


ACT GAA TCG 




gly 


lys 


tyr 




thr glu ser 












4951/1651 


TTC 


TCC 


ATG 


GCG 


GGC 


ATG TAC GAA 










gly 


met tyr glu 












5011/1671 


CAT 


GAA 


GCT 


AAT 


CGG 


GAT GCA AAG 


his 


glu 


ala 






asp ala lys 












5071/1691 


TTC 


AGC 


AAA 


ATC 


GTC 


CAT CAG GAT 


phe 


ser 


lys 


ile 


val 


his gin asp 



It 
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GGA 
gly 


AAC 


CTG 
leu 


GCT 
ala 


ACT 
thr 


GAA 
glu 


GCA 


ACT 
thr 


GTG 
val 


TCT 
ser 


GTA 
val 


ACG 
thr 


GAA 
glu 


TCC 


CAG 
gin 


AGC 


ATG 


GCC 
ala 


TGC 
cys 


AAC 


CAA 
gin 


GCC 
ala 


CTG 
leu 


GTC 
val 


TCA 


AAG 
lys 


TTT 


CCT 


CTG 
leu 


TGC 
cys 


CTC 
leu 


CGG 
arg 


CTT 
leu 


CTC 
leu 


CGT 
arg 


AGT 


GCC 
ala 


TCC 
ser 


CTT 
leu 


TAC 
tyr 


CTC 
leu 


CTC 
leu 


GTG 
val 


AAA 

lys 


ATG 


CAA 
gin 


GTG 
val 


ACG 
thr 


ATG 


GAA 
glu 


GAA 
glu 


TTC 
phe 


TTA 
leu 


AGA 
arg 


CGC 
arg 


TCT 


leu 


AGG 
arg 


GAA 


ACC 
thr 


ACA 
thr 


TTT 


CCT 
pro 


CTT 
leu 


TCT 


GAC 


ACT 
thr 


val 


AAG 
lys 


ATG 
met 


ATG 


TAC 
tyr 


AGA 

arg 


ATC 
ile 


GCC 
ala 


AAG 
lys 


GGC 
gly 


AAC 
asn 


ATG 


GCT 
ala 


GGC 
gly 


AAG 
lys 


CAC 
his 


TCT 
ser 


CAC 
his 


TCC 


GCA 
ala 


GCC 
ala 


CTT -GTC 


GCC 
ala 


val 


GGA 
gly 


TGC 
cys 


GTA 
val 


ACA 
thr 


TTC 
phe 


CAG 
gin 


GAT 


GAT 
asp 


GTG 
val 


GTA 
val 


TCT 


CCA 


GAT 
asp 


GGA 
gly 


CTG 
leu 


GTG 
val 


GGC 
gly 


TTA 
leu 


CTG 
leu 


GAA 
glu 


GCA 
ala 


GTT 
val 


AAT 


GAG 
glu 


val 


TAC 
tyr 


AAA 
lys 


AAG 
lys 


CTC 
leu 


TCC 


ACA 
thr 


ATT 

ile 


CAT 
his 


GGC 
gly 


GGT 
gly 


AAG 
lys 


CGG 
arg 


ATG 


TTT 


GGC 
gly 


ACC 
thr 
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5101/1701 5131/1711 

TAT TTC CGT GTT GGT TTT TAT GGA ACC AAG TTC GGG GAT TTG GAT GAA CAG GAA TTT GTT 
tyr phe arg val gly phe tyr gly thr lys phe gly asp leu asp glu gin glu phe val 
5161/1721 5191/1731 

TAC AAG GAG CCT GCA ATA ACC AAA CTC GCA GAG ATT TCT CAC AGA TTG GAG GGA TTT TAT 
tyr lys glu pro ala ile thr lys leu ala glu lie ser his arg leu glu gly phe tyr 
5221/1741 5251/1751 

GGA GAA AGA TTT GGA GAG GAT GTG CTT GAA GTA ATC AAG GAC TCT AAT CCT GTA GAC AAG 
gly glu arg phe gly glu asp val leu glu val ile lys asp ser asn pro val asp lys 
5281/1761 5311/1771 

TGC AAA TTA GAT CCA AAC AAG GCA TAT ATT CAG ATT ACC TAC GTG GAG CCC TTC TTT GAC 
cys lys leu asp pro asn lys ala tyr ile gin ile thr tyr val glu pro phe phe asp 
5341/1781 5371/1791 

ACC TAT GAG ATG AAG GAC AGA ATC ACT TAT TTT GAC AAA AAT TAT AAT CTC CGG CGT TTC 
thr tyr glu met lys asp arg lie thr tyr phe asp lys asn tyr asn leu arg arg phe 
5401/1801 5431/1811 

ATG TAC TGC ACA CCC TTC ACT TTA GAT GGC CGT GCT CAT GGG GAG CTT CAT GAA CAA TTC 
met tyr cys thr pro phe thr leu asp gly arg ala his gly glu leu his glu gin phe 
5461/1821 5491/1831 

AAA CGG AAG ACC ATT CTG ACA ACG TCT CAT GCC TTT CCT TAC ATT AAA ACA AGA GTC AAT 
lys arg lys thr ile leu thr thr ser his ala phe pro tyr ile lys thr arg val asn 
5521/1841 55S1/1851 

GTC ACT CAC AAA GAA GAG ATA ATC TTA ACA CCA ATT GAA GTT GCT ATC GAA GAC ATG CAG 
val thr his lys glu glu ile ile leu thr pro ile glu val ala ile glu asp met gin 
5581/1861 5611/1871 

AAA AAG ACT CAG GAA TTG GCA TTT GCA ACA CAT CAG GAT CCA GCA GAC CCC AAA ATG CTT 
lys lys thr gin glu leu ala phe ala thr his gin asp pro ala asp pro lys met leu 
5641/1881 5671/1891 

CAA ATG GTC CTC CAG GGG TCT GTA GGA ACA ACT GTA AAC CAG GGA CCT CTG GAG GTT GCC 
gin met val leu gin gly ser val gly thr thr val asn gin gly pro leu glu val ala 
5701/1901 5731/1911 

CAG GTC TTC CTG TCT GAA ATA CCT GGT GAC CCG AAG CTC TTC AGA CAT CAC AAC AAG CTG 
gin val phe leu ser glu ile pro gly asp pro lys leu phe arg his his asn lys leu 
5761/1921 5791/1931 

CGC CTT TGT TTC AAG GAC TTT ACT AAA AGG TGT GAG GAT GCC TTA CGA AAA AAT AAG AGC 
arg leu cys phe lys asp phe thr lys arg cys glu asp ala leu arg lys asn lys ser 
5821/1941 5851/1951 

TTA ATT GGA CCA GAT CAA AAG GAG TAT CAA AGA GAA CTG GAG AGA AAT TAC CAT CGC CTT 
leu ile gly pro asp gin lys glu tyr gin arg glu leu glu arg asn tyr his arg leu 
5881/1961 5911/1971 

AAA GAG GCT CTG CAG CCA CTG ATA AAC AGA AAA ATC CCT CAG CTA TAC AAG GCT GTG TTG 
lys glu ala leu gin pro leu ile asn arg lys ile pro gin leu tyr lys ala val leu 
5941/1981 5971/1991 

CCT GTC ACT TGC CAC AGA GAT TCC TTC AGC CGA ATG AGC CTT CGA AAA ATG GAA CTC TAA 
pro val thr cys his arg asp ser phe ser arg met ser leu arg lys met glu leu OCH 
6001/2001 6031/2011 

AAT ACA CAC TTG TTC TAT TCA TTT GAA AAG AAC CAT GTA TTC AAC ACT GAG TGT GAG GTC 



6181/2061 6211/2071 

TGG TAA CAT GCT GTC ATC TTA AAA ACA AAA GGT TTT AAT GAT TCA 

It 
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1/1 31/11 

ATG GCC GAA GTG CGC AAA TTC ACC AAG CGG CTG AGC AAG CCA GGC ACG GCA GCC GAG CTC 
Met ala glu val arg lys phe thr lys arg leu ser lys pro gly thr ala ala glu leu 
'61/21 91/31 

CGC CAG AGC GTG TCG GAG GCC GTG CGG GGC TCC GTG GTG CTG GAA AAG GCC AAG TTA GTT 
arg gin ser val ser glu ala val arg gly ser val val leu glu lys ala lys leu val 
121/41 151/51 

GAA CCC CTG GAC TAT GAA AAT GTT ATC ACT CAA CGG AAA ACC CAG ATC TAC AGT GAT CCC 
glu pro leu asp tyr glu asn val lie thr gin arg lys thr gin ile tyr ser asp pro 
181/61 211/71 

CTC AGA GAT CTG CTT ATG TTC CCC ATG GAA GAC ATA TCT ATC TCA GTG ATT GGT CGT CAG 
leu arg asp leu leu met phe pro met glu asp ile ser ile ser val ile gly arg gin 
241/81 271/91 

CGC AGA ACA GTG CAG TCT ACT GTA CCA GAG GAT GCT GAA AAG AGG GCC CAG AGT TTA TTT 
arg arg thr val gin ser thr val pro glu asp ala glu lys arg ala gin ser leu phe 
301/101 331/111 

GTC AAA GAG TGT ATT AAG ACC TAC AGC ACA GAT TGG CAT GTG GTG AAC TAT AAG TAT GAG 
val lys glu cys ile lys thr tyr ser thr asp trp his val val asn tyr lys tyr glu 
361/121 391/131 

GAT TTC TCT GGG GAC TTT CGG ATG TTG CCA TGC AAG TCT TTG AGA CCT GAA AAG ATT CCT 
asp phe ser gly asp phe arg met leu pro cys lys ser leu arg pro glu lys ile pro 
421/141 451/151 

AAT CAT GTG TTT GAG ATT GAT GAA GAT TGC GAG AAA GAT GAG GAT TCA TCT TCT TTA TGT 
asn his val phe glu ile asp glu asp cys glu lys asp glu asp ser ser ser leu cys 
481/161 511/171 

TCT CAG AAG GGT GGT GTG ATA AAA CAA GGT TGG TTG CAT AAA GCA AAT GTA AAT AGC ACC 
ser gin lys gly gly val ile lys gin gly trp leu his lys ala asn val asn ser thr 
541/181 571/191 

ATC ACA GTA ACC ATG AAG GTG TTC AAG AGA CGG TAT TTT TAT TTG ACT CAA CTT CCT GAT 
ile thr val thr met lys val phe lys arg arg tyr phe tyr leu thr gin leu pro asp 
601/201 631/211 

GGT TCA TAT ATT CTC AAT TCT TAT AAA GAT GAG AAA AAT TCC AAA GAG TCT AAA GGT TGC 
gly ser tyr ile leu asn ser tyr lys asp glu lys asn ser lys glu ser lys gly cys 
661/221 691/231 

ATC TAC TTG GAT GCC TGC ATT GAT GTT GTT CAG TGC CCC AAA ATG CGC CGT CAT GCT TTT 
ile tyr leu asp ala cys ile asp val val gin cys pro lys met arg arg his ala phe 
721/241 751/2S1 

GAA CTC AAG ATG TTA GAT AAG TAT AGC CAT TAT CTG GCA GCT GAA ACG GAG CAG GAA ATG 
glu leu lys met leu asp lys tyr ser his tyr leu ala ala glu thr glu gin glu met 
781/261 811/271 

GAG GAA TGG TTG ATA ATG TTG AAA AAG ATT ATT CAG ATC AAT ACT GAC AGT TTA GTG CAA 
glu glu trp leu ile met leu lys lys ile ile gin ile asn thr asp ser leu val gin 
841/281 871/291 

GAG AAA AAG GAC ACA GTC GAG GCA ATA CAA GAG GAG GAA ACT AGC AGC CAA GGA AAA GCC 
glu lys lys asp thr val glu ala ile gin glu glu glu thr ser ser gin gly lys ala 
901/301 931/311 

GAG AAC ATC ATG GCC AGT TTG GAA AGG AGC ATG CAT CCG GAA CTG ATG AAG TAT GGC AGA 
glu asn ile met ala ser leu glu arg ser met his pro glu leu met lys tyr gly arg 
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961/321 991/331 

GAA ACA GAA CAA CTG AAC AAA CTC AGT AGA GGA GAC GGA AGA CAG AAT CTC TTT TCT TTC 
glu thr glu gin leu asn lys leu ser arg gly asp gly arg gin asn leu phe ser phe 
1021/341 1051/351 

GAC TCA GAG GTT CAG AGA TTA GAC TTT TCA GGA ATT GAG CCT GAT GTA AAG CCA TTT GAA 
asp ser glu val gin arg leu asp phe ser gly lie glu pro asp val lys pro phe glu 
1081/361 1111/371 

GAA AAG TGC AAC AAG CGT TTC ATG GTG AAT TGC CAT GAT TTA ACT TTC AAT ATC TTG GGT 
glu lys cys asn lys arg phe met val asn cys his asp leu thr phe asn ile leu gly 
1141/381 1171/391 

CAC ATT GGA GAC AAT GCA AAA GGA CCA CCC ACA AAT GTT GAG CCC TTT TTC ATC AAC CTC 
his ile gly asp asn ala lys gly pro pro thr asn val glu pro phe phe ile asn leu 
1201/401 1231/411 

GCC TTA TTT GAT GTA AAG AAC AAT TGT AAA ATT TCA GCA GAC TTT CAT GTA GAT TTG AAC 
ala leu phe asp val lys asn asn cys lys ile ser ala asp phe his val asp leu asn 
1261/421 1291/431 

CCA CCC TCT GTC CGT GAA ATG CTG TGG GGT ACC TCG ACC CAA CTG TCC AAT GAT GGG AAT 
pro pro ser val arg glu met leu trp gly thr ser thr gin leu ser asn asp gly asn 
1321/441 1351/451 

GCA AAG GGC TTT TCA CCT GAG TCT CTC ATC CAT GGA ATT GCT GAG TCT CAG TTA TGC TAC 
ala lys gly phe ser pro glu ser leu ile his gly ile ala glu ser gin leu cys tyr 
1381/461 1411/471 

ATA AAG CAG GGA ATC TTC TCA GTG ACG AAC CCA CAT CCT GAA ATT TTC CTA GTT GTG AGA 
ile lys gin gly ile phe ser val thr asn pro his pro glu ile phe leu val val arg 
1441/481 1471/491 

ATT GAA AAG GTG CTG CAG GGA AAC ATC ACA CAC TGT GCA GAA CCC TAC ATC AAA AAC TCA 
ile glu lys val leu gin gly asn ile thr his cys ala glu pro tyr ile lys asn ser 
1501/501 1531/511 

GAT CCG ATA AAG ACA GCC CAG AAG GTA CAC AGG ACA GCT AAA CAA GTG TGT AGT CGC CTT 
asp pro ile lys thr ala gin lys val his arg thr ala lys. gin val cys ser arg leu 
1561/521 1591/531 

GGA CAA TAC AGA ATG CCA TTT GCT TGG GCA GCC AGA CCC ATT TTC AAA GAT GTT CAG GGC 
gly gin tyr arg met pro phe ala trp ala ala arg pro ile phe lys asp val gin gly 
1621/541 1651/551 

TCT CTG GAT CTG GAT GGG AGA TTT TCC CCT TTG TAT AAA CAA GAC AGT AGC AAG CTT TCA 
ser leu asp leu asp gly arg phe ser pro leu tyr lys gin asp ser ser lys leu ser 
1681/561 1711/S71 

AAT GAA GAT ATT CTC AAG TTG CTC TCA GAA TAC AAA AAG CCA GAG AAG ACA AAA TTG CAG 
asn glu asp ile leu lys leu leu ser glu tyr lys lys pro glu lys thr lys leu gin 
1741/581 1771/591 

ATT ATC CCT GGA CAG CTG AGT ATC ACA GTG GAA TGT GTT CCT GTA GAT TTA CCA AAT TGT 
lie ile pro gly gin leu ser ile thr val glu cys val pro val asp leu pro asn cys 
1801/601 1831/611 

ATT ACC TCT TCA TAT GTG CCC CTG AAG CCT TTT GAA AAG AAT TGT CAG AAT ATT ACT GTG 
ile thr ser ser tyr val pro leu lys pro phe glu lys asn cys gin asn ile thr val 
1861/621 1891/631 

GAG GTT GAA GAG TTT GTT CCA GAA ATG ACA AAA TAT TGT TAT CCA TTC ACT ATT TAC AAA 
glu val glu glu phe val pro glu met thr lys tyr cys tyr pro phe thr ile tyr lys 
1921/641 1951/651 

AAT CAT CTG TAT GTA TAT CCC TTG CAA TTA AAA TAT GAC AGC CAG AAA TCG TTT GCC AAG 
asn his leu tyr val tyr pro leu gin leu lys tyr asp ser gin lys ser phe ala lys 
1981/661 2011/671 

GCT AGG AAC ATT GCT GTC TGT GTG GAA TTC CGG GAT TCG GAT GAA AGT GAC GCG TCT GCG 
ala arg asn ile ala val cys val glu phe arg asp ser asp glu ser asp ala ser ala 
2041/681 2071/691 

CTA AAG TGC ATT TAC GGA AAA CCA GCA GGG TCT GTT TTT ACC ACA AAT GCT TAT GCT GTG 
leu lys cys ile tyr gly lys pro ala gly ser val phe thr thr asn ala tyr ala val 
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2101/701 

GTC TCG CAT CAC AAC CAA AAT CCG GAG TTC 
val ser his his asn gin asn pro giu phe 
2161/721 

CAC CTG CAC CAG AAG CAT CAC CTG CTT TTT 
his leu his gin lys his his leu leu phe 
2221/741 

ACA AAA GGA ACA ACC AAA AAG CAA GAC ACA 
thr lys gly thr thr lys lys gin asp thr 
2281/761 

CCT TTG CTG AAG GAT GGG AGG GTC ATC ACA 
pro leu leu lys asp gly arg val lie thr 
2341/781 

CTT CCT CCA GGC TAC TTG AAT GTG AAT GAT 
leu pro pro gly tyr leu asn val asn asp 
2401/801 

ATC AAG TGG GTA GAT GGT GCA AAG CCT TTG 
lie lys trp val asp gly ala lys pro leu 
2461/821 

ATT TAT ACT CAG GAT CTG CAT GTG CAC AAA 
ile tyr thr gin asp leu his val his lys 
2521/841 

GGC TCA AAA GAA GTT CCA GGG GAA CTC ATT 
gly ser lys glu val pro gly glu leu lie 
2581/861 

ATC CAA GTC ATG ATA CAG TTT CTA CCT GTA 
ile gin val met ile gin phe leu pro val 
2641/881 

AAC ATG ACC CAT GAA GAT GAC GTT CCC ATC 
asn met thr his glu asp asp val pro ile 
2701/901 

TCT AAG TGT CAT GAA GAA GGC TTG GAG AGC 
ser lys cys his glu glu gly leu glu ser 
2761/921 

AGA CCT GAA AAA CCA AGC ACT CTT CAG GCC 
arg pro glu lys pro ser thr leu gin ala 
2821/941 

ATG ATA GCA ATA TTG AAA CAG TCT GCA GAT 
met ile ala ile leu lys gin ser ala asp 
2881/961 

TCA TGG TTT TTC TTT GAA ATA ATT GCA AAG 
ser trp phe phe phe glu ile ile ala lys 
2941/981 

AAA ATT AAG CTC CCC AGG GGC CAA AGG TTT 
ly3 ile lys leu pro arg gly gin arg phe 
3001/1001 

CTG CTC CTT GCA ATA ATT CCC CAT GTG ACC 
leu leu leu ala ile ile pro his val thr 
3061/1021 

CGA AAT GGG AAT TAT AGC TTA GCC AGC TTT 
arg asn gly asn tyr ser leu ala ser phe 
3121/1041 

GGA TTT GTT TTC AAT TTG ATA AAT GAC TAT 
gly phe val phe asn leu ile asn asp tyr 
3181/1061 

GTT CTT GCT GAA TAC AAG TTT GAA TTT CTC 
val leu ala glu tyr lys phe glu phe leu 



2131/711 

TAT GAT GAG ATC AAA ATT GAG CTT CCC ATT 
tyr asp glu ile lys ile glu leu pro ile 
2191/731 

ACT TTT TAT CAT GTG AGT TGT GAA ATC AAC 
thr phe tyr his val ser cys glu ile asn 
2251/751 

GTG GAA ACT CCA GTG GGA TTT GCC TGG GTA 
val glu thr pro val gly phe ala trp val 
2311/771 

TTG GAG CAG CAG CTG CCA GTC TCA GCC AAT 
leu glu gin gin leu pro val ser ala asn 
2371/791 

GCT GAA TCA AGA AGG CAA TCA AAT GCG GAT 
ala glu ser arg arg gin ser asn ala asp 
2431/811 

CTG AAG ATT AAG ACC CAC TTA GAA TCA ACC 
leu lys ile lys thr his leu glu ser thr 
2491/831 

TTC TTC CAT CAT TGC CAG CTG ATT CAG TCG 
phe phe his his cys gin leu ile gin ser 
2551/851 

AAA TAT TTA AAG TGT TTG CAT GCC ATG GAG 
lys tyr leu lys cys leu his ala met glu 
2611/871 

ATT CTT ATG CAA CTC TTC CGA GTT CTC ACA 
ile leu met gin leu phe arg val leu thr 
2671/891 

AAC TGC ACC ATG GTT CTC TTA CAC ATT GTG 
asn cys thr met val leu leu his ile val 
2731/911 

TAC TTA AGA TCC TTC ATA AAG TAT AGC TTC 
tyr leu arg ser phe ile lys tyr ser phe 
2791/931 

CAG TTG ATA CAT GAA ACC CTG GCT ACG ACT 

gin leu ile his glu thr leu ala thr thr 
2851/951 

TTC CTA GCC ATA AAT AAA CTT CTA AAG TAC 
phe leu ala ile asn lys leu leu lys tyr 
2911/971 

TCG ATG GCC ACA TAC TTA TTG GAA GAG AAT 
ser met ala thr tyr leu leu glu glu asn 
2971/991 

CCT GAG GCG TAT CAC CAC GTC TTA CAT TCT 
pro glu ala tyr his his val leu his ser 
3031/1011 

ATT CGC TAT GCA GAG ATT CCT GAT GAG TCC 
ile arg tyr ala glu ile pro asp glu ser 
3091/1031 

CTG AAG CGC TGC TTG ACA CTG ATG GAC AGA 
leu lys arg cys leu thr leu met asp arg 
3151/1051 

ATA TCA GGA TTT AGC CCT AAG GAT CCT AAG 
ile ser gly phe ser pro lys asp pro lys 
3211/1071 

CAA ACA ATC TGC AAT CAT GAA CAT TAC ATT 
gin thr ile cys asn his glu his tyr ile 
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3241/1081 

CCT CTG AAC TTG CCA ATG GCA TTT GCA AAG 
pro leu asn leu pro met ala phe ala lys 
3301/1101 

CTT GAA TAC ACT TTA TCA GAT GAG TAT TGC 
leu glu tyr ser leu ser asp glu tyr cys 
3361/1121 

CGG GAA ACC TCC ATT GCT CTT CAA GAC AAC 
arg glu thr ser lie ala leu gin asp asn 
3421/1141 

ATA AAG AAT CTT TTG ATA AAA CAT GCA TTT 
ile lys asn leu leu ile lys his ala phe 
3481/1161 

GCC AAA ATA GCG CAG TTG TAC CTC CCA TTT 
ala lys lie ala gin leu tyr leu pro phe 
3541/1181 

CTG GCA GGC CGA GAT ACT CTG TAT TCC TGT 
leu ala gly arg asp thr leu tyr ser cys 
3601/1201 

GAG TTT CCA TGT GGC TTT GTT TCA CCT ACA 
glu phe pro cys gly phe val ser pro thr 
3661/1221 

ACA GCC TAT GGG TCA TTT CAA AAT GGT CAT 

thr ala tyr gly ser phe gin asn gly his 
3721/1241 

CTT ATT CCT GAA GGA GCA ACA GGA TTT CCA 
leu ile pro glu gly ala thr gly phe pro 
3781/1261 

CAG AGT TCT TCA AGG AGT AGT GTA TCC CAG 
gin ser ser ser arg ser ser val ser gin 
3841/1281 

AAC CTC TTG ATG TGC TAC TTA TAC ATA GTA 
asn leu leu met cys tyr leu tyr ile val 
3901/1301 

TAC TGG AAT AAA GTG TCT CCT CAG GAG CTC 
tyr trp asn lys val ser pro gin glu leu 
3961/1321 

TTG TTT CAC TTC CGA TAT ATG GGG AAA AGA 
leu phe his phe arg tyr met gly lys arg 
4021/1341 

TCC AAA CAC TTT GGA ATA GAT AGA AAA TCA 
ser lys his phe gly ile asp arg lys ser 
4081/1361 

GGG GTG ATG CAA GCT CGG CTT CAA CAT CTC 
gly val met gin ala arg leu gin his leu 
4141/1381 

CAC AGT TCT GCA ACA ACG GAA GCA GAC ATT 
his ser ser ala thr thr glu ala asp ile 
4201/1401 

GCT ACT GAA GTA TCT CTG ACA GTA CTA GAG 
ala thr glu val ser leu thr val leu glu 
4261/1421 

AAC CAA CTT TTA AAT AAT GAT GGT CAC AAC 
asn gin leu leu asn asn asp gly his asn 
4321/1441 

CTT GCT TTT CTT AAA AAT GGA CAA TCT GAA 
leu ala phe leu lys asn gly gin ser glu 



3271/1091 

CCT AAA TTA CAG AGA GTT CAA GAT TCC AAT 
pro lys leu gin arg val gin asp ser asn 
3331/1111 

AAG CAT CAC TTC TTG GTT GGC CTA CTT CTG 
lys his his phe leu val gly leu leu leu 
3391/1131 

TAT GAG ATC AGA TAC ACA GCT ATC TCA GTC 
tyr glu ile arg tyr thr ala ile ser val 
3451/1151 

GAT ACT AGA TAC CAG CAC AAG AAC CAG CAA 
asp thr arg tyr gin his lys asn gin gin 
3511/1171 

GTC GGA CTG CTC TTG GAA AAT ATC CAG CGA 
val gly leu leu leu glu asn ile gin arg 
3571/1191 

GCA GCC ATG CCT AGT TCT GCA TCT AGG GAT 
ala ala met pro ser ser ala ser arg asp 
3631/1211 

AAC AGA GCG AGC CTG GCC AGT GAC AAA GAC 
asn arg gly ser leu ala ser asp lys asp 
3691/1231 

GGA ATT AAG AGG GAA GAT TCA AGA GGT TCC 
gly ile lys arg glu asp ser arg gly ser 
3751/1251 

GAC CCT GCG AGC ACT AGT GAA AAC ACA CGA 
asp pro gly ser thr ser glu asn thr arg 
3811/1271 

TAT AAT CGT CTG GAT CAG TAT GAA ATC AGA 
tyr asn arg leu asp gin tyr glu ile arg 
3871/1291 

AAG ATG ATT. TCT GAA GAT ACT CTC TTG ACT 
lys met ile ser glu asp thr leu leu thr 
3931/1311 

ATA AAC ATT CTT GTA CTT CTA GAA GTA TGT 
ile asn ile leu val leu leu glu val cys 
3991/1331 

AAC ATA GCA AGG GTA CAT GAT GCC TGG CTA 
asn ile ala arg val his asp ala trp leu 
4051/1351 

CAA ACT ATG CCA GCT CTT CGA AAC AGA TCA 
gin thr met pro ala leu arg asn arg ser 
4111/1371 

AGC AGC TTA GAA AGC TCA TTC ACA CTT AAT 
ser ser leu glu ser ser phe thr leu asn 
4171/1391 

TTC CAC CAG GCG CTT CTG GAA GGC AAT ACT 
phe his gin ala leu leu glu gly asn thr 
4231/1411 

ACG ATC TCC TTT TTT ACC CAG TGC TTC AAG 
thr ile ser phe phe thr gin cys phe lys 
4291/1431 

CCA CTA ATG AAA AAA GTA TTT GAT ATT CAC 
pro leu met lys lys val phe asp ile his 
4351/1451 

GTG TCA CTG AAG CAT GTC TTT GCC TCA CTG 
val ser leu lys his val phe ala ser leu 
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4591/1531 
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4621/1541 
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CTA AGT GGA 
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4741/1581 
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4801/1601 
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5131/1711 
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ATT TGA TAT 




TGG 


AGT 


GTT 


GCC 


TCC 


TGA 


CAC CAA AAT 


CGT 


CAT 


GCT 


TTC ATA 


CAG 


GGT 


6361/2121 
















6391/2131 














GCT TGT ATC 


TGT 


AAA 


TAT 


TTA 


AGC 


AAC 


TCG 


AAG TGC CTG 


AAA 


AAT 


TGC 


AGC ACT 


GTG 




6421/2141 
















6451/2151 














GCT TGT ACT 






AGG 


TAA 


ATC 


TAT 


ATG 


CTG AAA AGT 


AGA 


GCT 


CAA 


AAC CAT 


TCG 




6481/2161 
















6511/2171 














AAT TGC TTA 


GTT 


ACC 


ACT 


TAC 


TAA 


TAA 


TGA 


GAA TAT GTA 


AAA 


TGT 


ATA 


AAT GGA 


AAC 


CCA 


6541/2181 
















6571/2191 














AAT AAA AGG 


TAA 


CAA 


ACT 


TAT 


TTG 


TAA 


AAA 


AAA AAA AAA 
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E. Houa* CTJLSP-5 cDNA s«qu 


























-282/1 








1 


















GGA CAC TGA CAT GGA CTG AAG 


GAG 


TAG 


AAA 


TCC 


TCA 


TAC 


AAG 


CAC 


AAG 


CCA 


CAG 


CCA 


-222/21 








192/31 


















CTC TCT CTG GCT GAT CTA GTG 


GTT 


ATC 




GA AGG 


TTA 


GGT 


TCT 


TTT 


TTG 


TTT 


CTA 


CTT 


-162/41 








132/51 


















ATT ACT TCA AAG CCA CTT CTG 


CTC 


TTC 


GAA 


ATT TTG 


TAA 


CCT 


TCC 


CCT 


GTT 


TCC 


TGT 


AGC 


-102/61 


























CGC CAT TTT GTC GCC CGT GAC 


AGT 


TTA 


CCC 


ACC GCG 


TTA 


ACT 


GTG 


TGT 


CTT 


CAG 


CCT 


CAG 


-42/81 


























CTT TAT GAG CCT GTC GAG CCA 


GTG 


GAC 


TTT 


GAA GGA 


CTC 


GTG 


ATG 


ACA 


CAC 


TTA 


AAC 


AGC 
















met 


thr 


his 


leu 


asn 


ser 


19/101 


























TTG GAT GCA GAG CTG GCC CAG 


GAG 


CTG 


GGG 


GAC CTC 


ACC 


GAT 


GAC 


GAC 


CTG 


CAT 


GTG 


GCC 


leu asp ala glu leu ala gin 


glu 


leu 


gly 


asp leu 


thr 


asp 


asp 


asp 


leu 


his 


val 


ala 


79/121 


























TTC ACA CCC AAA GAA TGT AGG 


ACT 


TTG 


CAG 


CAC TCT 


CTG 


CCA 


GAG 


GAA 


GGA 


GTT 


GAA 


CTG 


phe thr pro lys glu cys arg 


thr 


leu 


g n 




leu 


pro 


glu 


glu 


gly 


val 


glu 


leu 


139/141 








169/151 












GAT CCT CAC GTC AGA GAC TGT 


GTT 


CAG 


ACC 


TAT ATT 


CGA 


GAG 


TGG 


CTG 


ATT 


GTA 


AAC 


CGG 


asp pro his val arg asp cys 


val 


gin 


thr 


tyr ile 


arg 


glu 


trp 


leu 


ile 


val 


asn 


arg 


199/161 
























AAA AAC CAA GGA AGT TCA GAG 


TTT 


TGC 


AGC 


TTT AAA 


AAG 


ACG 


GGA 


TCT 


CGC 


AGA 


GAT 


TTT 


lys asn gin gly ser ser glu 


phe 


cys 


ser 




lys 


thr 


gly 


ser 


arg 


arg 


asp 


phe 


259/181 








289/191 
















CAG AAG ACG CTT CAG AAA CAG 


ACG 


TTT 


GAG 


TCA GAA 


ACC 


TTG 


GAG 


TGC 


AGT 


GAA 


CCG 


GAC 


gin lys thr leu gin lys gin 


thr 


phe 


g u 




thr 


leu 


glu 


cys 


ser 


glu 


pro 


asp 


319/201 


















ACT CAG ACA GGA CCC CGT CAT 


CCC 


TTA 


AAC 


GTG CTG 


TGT 


GAC 


GTG 


TCT 


GGG 


AAG 


GGC 


CCC 


thr gin thr gly pro arg his 


pro 


leu 


asn 




cys 


asp 


val 


ser 


gly 


lys 


gly 


pro 


379/221 
























CTC ACT TCT TGT GAC TTC GAC 


CTC 


CGC 




CTG CAG 


CCT 


GAT 


GAG 


CGG 


CTG 


GAA 


AAC 


CTG 


leu thr ser cys asp phe asp 


leu 


arg 


ser 




pro 


asp 


glu 


arg 


leu 


glu 


asn 


leu 


439/241 
























CTC CAG CTT GTG AGC GCT GAG 


GAC 


TTT 






AAG 


GAG 


GAG 


GCC 


CGC 


AAG 


ACC 


AAT 


leu gin leu val ser ala glu 


asp 


phe 


g u 


lys glu 


lys 


glu 


glu 


ala 


arg 


lys 


thr 


asn 


499/261 








529/271 


















CGG CCG GCT GAG CTC TTT GCC 


CTC 


TAT 


CCG 


CCC GTG 


GAT 


GAG 


GAG 


GAT 


GCT 


GTG 


GAA 


ATA 


arg pro ala glu leu phe ala 


leu 


tyr 


pro 


pro val 


asp 


glu 


glu 


asp 


ala 


val 


glu 


ile 










589/291 








CGT CCA GTA CCT GAA TGT CCT 


AAG 


GAA 


CAT 


CTG GGC 


AAC 


AGA 


ATA 


TTG 


GTC 


AAG 


GTG 


CTG 


arg pro val pro glu cys pro 


lys 


glu 


his 


leu gly 




arg 


ile 


leu 


val 


lys 


val 


leu 


619/301 








649/311 














ACT CTG AAG TTT GAG ATT GAA 


ATT 


GAA 


CCT 


CTG TTT 


GCC 


AGT 


ATT 


GCC 


CTC 


TAT 


GAC 


GTT 


thr leu lys phe glu ile glu 


ile 


glu 


pro 




ala 




ile 


ala 


leu 


tyr 




val 


679/321 








709/331 














AAA GAA AGG AAA AAG ATC TCA 


GAA 


AAT 


TTC 


CAC TGT 


GAC 


CTG 


AAC 


TCC 


GAC 


CAG 


TTC 


AAA 


lys glu arg lys lys ile ser 


glu 




phe 




asp 


leu 


asn 


ser 


asp 


gin 


phe 


lys 


739/341 








769/351 












GGG TTT CTG CGA GCT CAC ACA 


CCC 


TCG 


ATT 


GAC CCA 


TCG 


AGT 


CAG 


GCT 


AGG 


TCT 


GCC 


GTG 


gly phe leu arg ala his thr 


pro 


ser 


ile 


asp pro 


ser 


ser 


gin 


ala 


arg 


ser 


ala 


val 



Figure > (25 of 35) 



26/95 



799/361 
















829/371 
















TTC TCC GTC 


ACC 


TAC 


CCA 


TCT 


TCA 


GAC 


ATC 


TAC CTG GTT 


GTC 


AAG 


ATT 


GAA 


AAG 


GTC 


CTT 


phe ser val 


thr 


tyr 


pro 


ser 


ser 


asp 


ile 


tyr leu val 


val 


lys 


ile 


glu 


lys 


val 


leu 


859/381 
















889/391 










CAG CAA GGA 


GAG 


ATT 


GCA 


GAC 


TGT 


GCA 


GAA 


CCC TAC ATG 


ATC 


ATC 


AAA 


GAA 


AGC 


GAT 


GGT 


gin gin gly 


glu 


lie 


ala 


asp 


cys 


ala 


glu 


pro tyr met 


ile 


ile 


lys 


glu 


ser 


asp 


giy 


919/401 
















949/411 












GGA AAG AGT 


AAA 


GAA 


AAG 


GTT 


GAA 


AAA 


CTA 


AAA CTT CAA 


GCT 


GAA 


TCC 


TTC 


TGC 


CAA 


CGT 


gly lys ser 


lys 


glu 


lys 


val 


glu 


lys 


leu 


lys leu gin 


ala 


glu 


ser 


phe 


cys 


gin arg 


979/421 
















1009/431 
















TTG GGG AAA 


TAC 


CGG 


ATG 


CCC 


TTC 


GCC 


TGG 


GCC CCC ATT 


AGC 


TTA 


GCA 


AGC 


TTC 


TTC 


AAC 


leu gly lys 


tyr 


arg 


met 


pro 


phe 


ala 


trp 


ala pro ile 


ser 


leu 


ala 


ser 


phe 


phe 


asn 


1039/441 
















1069/451 












GTC TCC ACC 


CTT 


GAA 


AGG 


GAG 


AGC 


ACA 


GAT 


GTG GAG CCT 


GGG 


GTT 


GGG 


AGG 


AAC 


TCT 


GTG 


val ser thr 


leu 


glu 


arg 


glu 


ser 


thr 


asp 


val glu pro 


gly 


val 


gly 


arg 


asn 




val 


1099/461 
















1129/471 
















GGT GAG AAG 


AGG 


AGC 


TTG 


TCC 


CAA 


TCC 


AGG 


AGG CCC TCT 


GAG 


CGA 


ACC 


CTC 


TCC 


TTG 


GAG 


gly glu lys 


arg 


ser 


leu 


ser 


gin 


ser 


arg 


arg pro ser 


glu 


arg 


thr 


leu 


ser 


leu glu 


1159/481 
















1189/491 
















GAA AAT GGA 


GTT 


GGA 


TCC 


AAC 


TTC 


AAA 


GCC 


ACC ACC TTG 


GCC 


ACC 


AAC 


ATC 


TTC 


TTC 


AAA 


glu asn gly 


val 


gly 


ser 


asn 


phe 


lys 




thr thr leu 


ala 


thr 


asn 


lie 


phe 




lys 


1219/501 
















1249/511 










CAG GAG GGA 


GAT 


CGC 


CTT 


AGT 


GAT 


GAA 


GAC 


TTG TTC AAG 


TTT 


TTA 


GCT 


GAC 


TAC 


AAG 


AGA 


gin glu gly 


asp 


arg 


leu 


ser 


asp 


glu 




leu phe lys 


phe 


leu 


ala 


asp 


tyr 


lys 


arg 


1279/521 
















1309/531 














TCT TCA TCC 


CTA 


CAG 


CGA 


AGA 


GTC 


AAA 


TCC 


ATC CCA GGC 


TCA 


CTG 


AGG 


CTG 


GAG 


ATA 


TCC 


ser ser ser 


leu 


gin 


arg 


arg 


val 


lys 




ile pro gly 


ser 


leu 


arg 


leu 


glu 


ile 


ser 


1339/541 
















1369/551 
















CCA GCT CCC 


GAC 


GTG 


ATG 


AAC 


TGC 


TGC CTG 


ACG CCC GAG 


ATG 


CTG 


CCA 


GTC 


AAA 


CCT 


TTT 


pro ala pro 


asp 


val 


met 


asn 


cys 


cys 


leu 


thr pro glu 


met 


leu 


pro 


val 


lys 


pro phe 


1399/561 
















1429/571 
















CCT GAA AAT 


CGG 


ACG 


CGT 


CCA 


CAC 


AAG. GAG 


ATT TTG GAA 


TTT 


CCG 


ATC 


CGG 


GAG 


GTG 


TAC 


pro glu asn 


arg 


thr 


arg 


pro 


his 


lys glu 


ile leu glu 


phe 


pro 


ile 


arg 


glu 


val 


tyr 


1459/581 
















1489/591 












GTC CCT CAC 


ACT 


GTG 


TAC 


AGA 


AAC 


CTT 


CTG 


TAC GTA TAC 


CCC 


CAG 


CGA 


CTG 


AAC 




GCT 


val pro his 


thr 


val 


tyr 


arg 


asn 


leu 


leu 


tyr val tyr 


pro 


gin 


arg 


leu 


asn 




1519/601 
















1549/611 
















AGC AAG CTA 


GCA 


TCT 


GCC 


CGG 


AAC 


ATC 


ACA 


ATA AAG ATT 


CAG 


TTT 


ATG 


TGC 


GGA 


GAA 


GAC 


ser lys leu 


ala 


ser 


ala 


arg 


asn 


ile 


thr 


ile lys ile 


gin 


phe 


met 


cys 


gly 


glu 




1579/621 
















1609/631 
















CCC AGC AAT 


GCT 


ATG 


CCG 


GTC 


ATC 


TTT 


GGC 


AAG TCC AGT 


GGG 


CCT 


GAA 


TTT 


CTG 


CAG 


GAA 


pro ser asn 


ala 


met 


pro 


val 


ile 


phe gly 




gly 


pro 


glu 


phe 


leu 




glu 


1639/641 
















1669/651 
















GTA TAT ACA 


GCT 


ATT 


ACA 


TAC 


CAT 


AAT 


AAG 


TCT CCT GAC 


TTT 


TAC 


GAA 


GAA 


GTG 


AAA 


ATT 


val tyr thr 


ala 


ile 


thr 


tyr 


his 


asn 


lys 


ser pro asp 


phe 


tyr 


glu 


glu 


val 


lys 


ile 


1699/661 
















1729/671 
















AAG CTC CCT 


GCC 


AAG 


CTC 


ACA 


GTG 


AAT 


CAT 


CAC CTC CTC 


TTC 


ACC 


TTC 


TAC 


CAC 


ATC 


AGC 


lys leu pro 


ala 


lys 


leu 


thr 


val 




his 


his leu leu 


phe 


thr 


phe 


tyr 


his 


ile 


ser 


1759/681 
















1789/691 










TGT CAG CAG 


AAG 


CAA 


GGG 


GCC 


TCC 


GGA 


GAA 


AGC CTT CTG 


GGG 


TAC 


TCG 


TGG 


CTG 


CCG 


ATT 


cys gin gin 


lys 




gly 


ala 


ser 


gly glu 


ser leu leu 


gly 


tyr 


ser 


trp 


leu 




ile 


1819/701 
















1849/711 








CTG TTA AAC 


GAA 


CGT 


CTT 


CAA 


ACC 


GGA 


TCC 


TAC TGT CTG 


CCT 


GTT 


GCC 


TTG 


GAA 


AAA 


CTA 


leu leu asn 


glu 


arg 


leu 


gin 


thr 


gly 




tyr cys leu 




val 


ala 


leu 




lys 


leu 


1879/721 
















1909/731 














CCA CCC AAC 


TAC 


TCC 


ATA 


CAT 


TCT 


GCT 


GAG 


AAA GTC CCT 


TTA 


CAG 


AAT 


CCT 


CCC 


ATT 


AAG 


pro pro asn 


tyr 


ser 


ile 


his 


ser 




glu 


lys val pro 


leu 




asn 




pro 


ile 


lys 
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1939/741 1969/751 

TGG GTC GAG GGC CAT AAG GGA GTA TTT AAT ATT GAA GTG CAA GCT GTT TCT TCC GTC CAC 
trp val glu gly his lys gly val phe asn lie glu val gin ala val ser ser val his 
1999/761 2029/771 

ACC CAG GAT AAC CAC CTG GAG AAG TTC TTC ACC CTT TGC CAC TCC CTG GAG AGC CAG GTG 
thr gin asp asn his leu glu lys phe phe thr leu cys his ser leu glu ser gin val 
2059/781 2089/791 

AGC TTC CCT ATC CGT GTG CTG GAC CAG AAG ATC ACC GAG AGC ACG CTG GAG CAC GAG CTG 
ser phe pro ile arg val leu asp gin lys lie thr glu ser thr leu glu his glu leu 
2119/801 2149/811 

AAA CTC AGC ATC ATC TGC CTC AAC TCC TCC CGC CTG GAG CCC CTC GTG CTC TTC CTC CAC 
lys leu ser ile ile cys leu asn ser ser arg leu glu pro leu val leu phe leu his 
2179/821 2209/831 

CTG GTG CTC GAC AAG CTG TTC CAG CTT TCC GTG CAG CCC ATG GTC ATT GCT GGC CAA ACA 
leu val leu asp lys leu phe gin leu ser val gin pro met val ile ala gly gin thr 
2239/841 2269/851 

GCA AAC TTC TCC CAG TTT GCC TTC GAG TCT GTG GTG GCC ATT GCC AAT AGC CTG CAC AAC 
ala asn phe ser gin phe ala phe glu ser val val ala ile ala asn ser leu his asn 
2299/861 2329/871 

AGC AAG GAC CTG AGG AAG GAC CAG CAC GGA AGG AAC TGC CTT CTG GCC TCC TAT GTG CAC 
ser lys asp leu arg lys asp gin his gly arg asn cys leu leu ala ser tyr val his 
2359/881 2389/891 

TAC GTG TTC CGG CTG CCG GAA CTG CAC AGG GAT ACA TCC AAG TCA GGT GGC CCC ATC ACC 

tyr val phe arg leu pro glu leu his arg asp thr ser lys ser gly gly pro ile thr 
2419/901 2449/911 

GTA GTC CCC GAC CCC CGA TAC CAC ACA TAT GGA CGC ACA TCT GCC GCT GCA GTG AGT TCA 
val val pro asp pro arg tyr his thr tyr gly arg thr ser ala ala ala val ser ser 
2479/921 2509/931 

AAG CTG ATG CAG GCC CGT GTG ATG AGC AGC AGC AAC CCA GAC CTG ACT GGG TCA CAC TGT 
lys leu met gin ala arg val met ser ser ser asn pro asp leu thr gly ser his cys 
2539/941 2569/951 

GCA GCC GAT GAG GAA GTT AAG AAC A TC ATG TC T TCA AAG ATT GCC GAT CGC AAC TGC AGC 
ala ala asp glu glu val lys asn ile met ser ser lys ile ala asp arg asn cys ser 
2599/961 2629/971 

CGG ATG TCT TAC TAT TGC TCT GGC AAT AGT GAT GCG CCA GGT TCA ACT GCA GCC CCA AGA 
arg met ser tyr tyr cys ser gly asn ser asp ala pro gly ser thr ala ala pro arg 
2659/981 2689/991 

CCA GTC AGC AAA AAG CAT TTC CAT GAG GAG CTT GCC CTG CAG ATG GTG GTC AGC ACT GGA 
pro val ser lys lys his phe his glu glu leu ala leu gin met val val ser thr gly 
2719/1001 2749/1011 

GTA GTG AGA GAA ACG GTC TTC AAG TAC GCC TGG TTC TTC TTT GAG CTT CTG GTG AAA AGC 
val val arg glu thr val phe lys tyr ala trp phe phe phe glu leu leu val lys ser 
2779/1021 2809/1031 

ATG GCG CAG TAT GTC CAT AAC CTG GAT AAA CGG GAC AGT TTT CGG AGG ACT CGT TTT TCT 
met ala gin tyr val his asn leu asp lys arg asp ser phe arg arg thr arg phe ser 
2839/1041 2869/1051 

GAC CGC TTC AAA GAT GAC ATA ACT ACC ATT GTT AAT GTG GTC ACC TCG GAG ATA GCA GCC 
asp-arg phe lys asp asp ile thr thr ile val asn val val thr ser glu ile ala ala 
2899/1061 2929/1071 

CTT TTA GTG AAA CCT CAG AAG GAA AGC GAG CAG GCA GAA AAG ATC AAC ATC AGC CTT GCC 
leu leu val lys pro gin lys glu ser glu gin ala glu lys ile asn ile ser leu ala 
2959/1081 2989/1091 

TTC TTC CTG TAT GAC CTC CTG TCA ATC ATG GAC AGA GGC TTC GTG TTC AAC CTC ATC AAG 
phe phe leu tyr asp leu leu ser ile met asp arg gly phe val phe asn leu lie lys 
3019/1101 3049/1111 

CAT TAC TGC AGC CAG CTG TCA GCC AAG CTG AAT ATC CTT CCA ACG CTC ATC TCC ATG CGG 
his tyr cys ser gin leu ser ala lys leu asn ile leu pro thr leu ile ser met arg 



it 
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3079/1121 

CTG GAA TTC CTG AGG ATC CTC TGC AGC CAT 
leu glu phe leu arg ile leu cys ser his 
3139/1141 

ATG AAT ACC GAC ACC GCA CCA GCA TCT CCC 
met asn thr asp thr ala pro ala ser pro 
3199/1161 

TCC TGC TCC AGT TTC CAG GAC CAA AAG ATT 
ser cys ser ser phe gin asp gin lys ile 
32S9/1181 

CGG CAG CAG CAC TTC CTT ACA GGG CTG CTC 
arg gin gin his phe leu thr gly leu leu 
3319/1201 

GAG GGG GAT GGA ATT AGC AGA GTA CAG AGA 
glu gly asp gly lie ser arg val gin arg 
3379/1221 

TGT TCT CAC GAC CTG GAT CCA CGG TGT CGC 
cys ser his asp leu asp pro arg cys arg 
3439/1241 

CTT TAC CTG CCG TTG GTC GGC ATC ATT CTG 
leu tyr leu pro leu val gly ile ile leu 
3499/1261 

GAT GCT CGC AGT GGA AGG AGT CGT GCC AGT 
asp ala arg ser gly arg ser arg ala ser 
3559/1281 

GGA ATC AAC CAG AAT GTG GCC CTG GCC ATA 
gly lie asn gin asn val ala leu ala ile 
3619/1301 

GGA GCA ATG CTG TCT TCC TTG CCC TAT AAG 
gly ala met leu ser ser leu pro tyr lys 
3679/1321 

CGC CAC CTC ATG ATT TGC TTC CTG TGG ATC 
arg his leu met ile cys phe leu trp ile 
3739/1341 

AAG TGG ATC GCT GAC CTG CCT TCC ATG CAG 
lys trp ile ala asp leu pro ser met gin 
3799/1361 

TGT GTC TCC TGC TTT GAA TAC AAG GGA AAG 
cys val ser cys phe glu tyr lys gly lys 
3859/1381 

CTG CAG AAG TCA AGA GAC GTC AAG GCC AAG 
leu gin lys ser arg asp val lys ala lys 
3919/1401 

GCC CGT GGG GAG ATG ATG CGC CGT CGC ATT 
ala arg gly glu met met arg arg arg- ile 
3979/1421 

GAA AAT CTG AGA TGG AGG AAA GAG CAG ACA 
glu asn leu arg trp arg lys glu gin thr 
4039/1441 

AAA ACA AAG GCA GAG TTA GAT CAA GAA GCC 
lys thr lys ala glu leu asp gin glu ala 
4099/1461 

AAT TTG ATC ATC CTG GAT ATG CAG GAG AAC 
asn leu ile ile leu asp met gin glu asn 
4159/1481 

AAA GAC AGC CTG CTG GGA GGT GTC CTC CGG 
lys asp ser leu leu gly gly val leu arg 



3109/1131 

GAG CAC TAC CTC AAC TTG AAC CTC CTC TTC 
glu his tyr leu asn leu asn leu leu phe 
3169/1151 

TGC CCC TCC ATA TCC TCC CAG AAC TCG AGT 
cys pro ser ile ser ser gin asn ser ser 
3229/1171 

GCC AGC ATG TTC GAT CTG ACC CCG GAG TAC 
ala ser met phe asp leu thr pro glu tyr 
3289/1191 

TTC ACG GAG CTG GCT GTT GCC CTG GAT GCT 
phe thr glu leu ala val ala leu asp ala 
3349/1211 

AAA GCC GTG AGT GCC ATC CAC AGC CTT CTG 
lys ala val ser ala ile his ser leu leu 
3409/1231 

AAA CCG GAA GTG AAA GTC AAA ATC GCC GCC 
lys pro glu val lys val lys ile ala ala 
3469/1251 

GAC GCT CTG CCA CAG CTC TAT GAC TTT ACA 
asp ala leu pro gin leu tyr asp phe thr 
3529/1271 

GGC TCG TAT GAA GAA CAA GAT GTG GCC AAC 
gly ser tyr glu glu gin asp val ala asn 
3589/1291 

GCG GGG AAT CAC TTT AAT TTG AAG ACC AGT 
ala gly asn his phe asn leu lys thr ser 
3649/1311 

CAG TAC AAC ATG CTG AAT GCA GAC ACC ACC 
gin tyr asn met leu asn ala asp thr thr 
3709/1331 

ATG AAA AAT GCT GAT CAG AGC CTC ATC AGG 
met lys asn ala asp gin ser leu ile arg 
3769/1351 

CTC AAT AGG ATT CTG GAC CTG CTG TTC ATC 
leu asn arg ile leu asp leu leu phe ile 
3829/1371 

CAG AGT TCT GAC AAA GTC AGT AAC CAG GTC 
gin ser ser asp lys val ser asn gin val 
3889/1391 

CTG GAA GAG GCC CTG CTC CGT GGG GAA GGA 
leu glu glu ala leu leu arg gly glu gly 
3949/1411 

CCA GGG ACT GAC CGG TTT CCA GGC ATA AAT 
pro gly thr asp arg phe pro gly ile asn 
4009/1431 

CAG TGG CGG CAG GCT AAT GAG AAG CTG GAC 
gin trp arg gin ala asn glu lys leu asp 
4069/1451 

TTG ATC AGT GGC AAC CTG GCT ACA GAA GCT 
leu ile ser gly asn leu ala thr glu ala 
4129/1471 

ATC ATC CAG GCA AGC TCC GCC CTG GAC TGT 
ile ile gin ala ser ser ala leu asp cys 
4189/1491 

GTC CTG GTG AAT TCT CTG AGC TGT GAC CAG 
val leu val asn ser leu ser cys asp gin 
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4219/1501 
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4549/1611 










TTA AGG ACA 


ATT 


TTG 


GCC 


TAT 


TCA 


GAA 


GAG 


GAC ACG GCC 


ATG 


CAG 


ACA 


ACT 


CCT 


TTT 


CCC 


leu arg thr 


lie 


leu 


ala 


tyr 


ser 


glu 


glu 


asp thr ala 


met 


gin 


thr 


thr 


pro 


phe 


pro 


4579/1621 
















4609/1631 










ATG CAG GTG 


GAG 


GAA 


CTT 


CTC 


TGC 


AAT 


CTG 


AAC AGC ATT 


CTG 


TAC 


GAC 


ACA 


GTG 


AAG 


ATG 
met 


met gin val 


glu 


glu 


leu 


leu 


cys 


asn 


leu 


asn ser ile 


leu 


tyr 




thr 


val 


lys 


4639/1641 
















4669/1651 










AGG GAA TTC 


CAG 


GAA 


GAC 


CCT 


GAG 


ATG 


CTT 


ATG GAC CTC 


ATG 


TAC 


AGA 


ATT 


GCC 


AAG 


AGC 


arg glu phe 


gin 


glu 


asp 


pro 


glu 


met 


leu 


met asp leu 


met 


tyr 


arg 


ile 


ala 


lys 




4699/1661 
















4729/1671 










TAC CAG GCA 


TCG 


CCT 


GAC 


CTG 


CGG 


CTG 


ACT 


TGG CTC CAG 


AAC 


ATG 


GCA 


GAG 


AAA 


CAC 


ACT 


tyr gin ala 


ser 


pro 


asp 


leu 


arg 


leu 


thr 


trp leu gin 


asn 




ala 


glu 


lys 


his 


thr 


4759/1681 
















4789/1691 












AAG AAG AAG 


TGC 


TTC 


ACA 


GAG 


GCC 


GCC 


ATG 


TGC CTG GTG 


CAT 


GCA 


GCC 


GCC 


CTG 


GTG 


GCC 


lys lys lys 


cys 


phe 


thr 


glu 


ala 


ala 


met 


cys leu val 


his 


ala 


ala 


ala 


leu 


val 


ala 


4819/1701 
















4849/1711 














GAG TAC CTG 


AGC 


ATG 


CTG 


GAG 


GAC 


CAC 


AGC 


TAC CTG CCG 


GTG 


GGC 


AGC 


GTC 


AGC 


TTT 


CAG 


glu tyr leu 


ser 


met 


leu 


glu 


asp 


his 


ser 


tyr leu pro 


val 


gly 


ser 


val 


ser 


phe 


gin 


4879/1721 
















4909/1731 










AAT ATT TCT 


TCC 


AAT 


GTG 


CTT 


GAG 


GAG 


TCT 


GCA GTC TCT 


GAT 


GAC 


ACC 


TTG 


TCA 


CCT 


GAT 


asn lie ser 


ser 


asn 


val 


leu 


glu 


glu 


ser 


ala val ser 


asp 


asp 


thr 


leu 


ser 


pro 


asp 


4939/1741 
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5359/1881 S389/1891 

ACC AAG TTG GAT CCT AAC AAG GCC TAG ATT CAG ATC ACT TTT GTG GAG CCT TAC TTT GAT 
thr lys leu asp pro asn lys ala tyr lie gin lie thr phe val glu pro tyr phe asp 
5419/1901 5449/1911 

GAA TAT GAG ATG AAA GAC CGG GTG ACC TAC TTC GAG AAG AAT TTC AAC CTC CGG AGG TTC 
glu tyr glu met lys asp arg val thr tyr phe glu lys asn phe asn leu arg arg phe 
5479/1921 5509/1931 

ATG TAC ACC ACC CCC TTC ACC CTG GAG GGG AGA CCC CGG GGC GAG CTT CAT GAG CAA CAC 
met tyr thr thr pro phe thr leu glu gly arg pro arg gly glu leu his glu gin his 
5539/1941 5569/1951 

CGC AGA AAC ACC GTG CTC ACC ACC ATG CAC GCC TTC CCC TAC ATC AAG ACC AGG ATC CGA 
arg arg asn thr val leu thr thr met his ala phe pro tyr ile lys thr arg ile arg 
5599/1961 5629/1971 

GTC AGC CAG AAA GAG GAG TTC GTT TTG ACT CCG ATT GAG GTT GCC ATT GAA GAT ATG AAG 
val ser gin lys glu glu phe val leu thr pro ile glu val ala ile glu asp met lys 
5659/1981 5689/1991 

AAG AAG ACC CTG CAG TTA GCC GTG GCC ACT CAC CAG GAG CCC CCT GAT GCA AAG ATG CTG 
lys lys thr leu gin leu ala val ala thr his gin glu pro pro asp ala lys met leu 
5719/2001 5749/2011 

CAA ATG GTA CTG CAG GGC TCT GTA GGA GCC ACT GTA AAT CAG GGA CCA CTG GAG GTG GCC 
gin met val leu gin gly ser val gly ala thr val asn gin gly pro leu glu val ala 
5779/2021 5809/2031 

CAA GTG TTC TTG GCT GAA ATT CCA GCT GAC CCA AAG CTC TAC CGA CAT CAC AAC AAG CTG 
gin val phe leu ala glu ile pro ala asp pro lys leu tyr arg his his asn lys leu 
5839/2041 5869/2051 

AGG TTG TGC TTC AAG GAG TTC ATA ATG CGA TGC GGA GAG GCC GTG GAG AAG AAC AGG CGA 
arg leu cys phe lys glu phe ile met arg cys gly glu ala val glu lys asn arg arg 
5899/2061 5929/2071 

CTC ATC ACC GCA GAG CAG CGG GAG TAC CAG CAG GAG CTG AAG AAG AAC TAC AAC AAG CTG 
leu ile thr ala glu gin arg glu tyr gin gin glu leu lys lys asn tyr ash lys leu 
5959/2081 5989/2091 

AGA GAC AGC CTC AGG CCC ATG ATT GAG CGG AAA ATC CCA GAG CTC TAC AAG CCC ATA TTC 
arg asp ser leu arg pro met ile glu arg lys ile pro glu leu tyr lys pro ile phe 
6019/2101 6049/2111 

AGA GTT GAC AGT CAG AAG AGG GAC TCT TTC CAC AGA TCT AGT TTC AGG AAA TGT GAA ACC 
arg val asp ser gin lys arg asp ser phe his arg ser ser phe arg lys cys glu thr 
6079/2121 6109/2131 

CAG TTG TCA CAG GGC AGC TGA AAA AAG CCA CCT TTG CCG GTG ACG ACT GGG GCC CCA CTA 

gin leu ser gin gly ser OPA 

6139/2141 

CTG GGA AGG ACT CGC TGG 



T . mouse CIASP-7 cDNA Eoqufince 

1. Fragment 1 

' 1/1 

CGC CGT CGG GGG CCC CAG GAT CGA GGC TAT 
arg arg arg gly pro gin asp arg gly tyr 
61/21 

TTC CGC CCG GCC ACG TTA ACT GTC ACC AAC 
phe arg pro ala thr leu thr val thr asn 



31/11 

AGT GGG GAT GAT GCC TGC AGC TTC TCT AGC 
ser gly asp asp ala cys ser phe ser ser 
91/31 

TTT TTC AAA CAG GAG GCA GAA CGG CTC AGT 
phe phe lys gin glu ala glu arg leu ser 
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121/41 151/51 

GAT GAA GAC CTT TTC AAG TTT CTG GCC GAC ATG CGG CGA CCA TCA TCC CTG CTG AGG CGA 
asp glu asp leu phe lys phe leu ala asp met org argr pro ser ser leu leu arg arg 
181/61 211/71 

TTG CGC CCT GTG ACA GCT CAG CTC AAG CTT GAC ATC TCC CCG GCC CCC GAG AAC CTG CAC 
leu arg pro val thr ala gin leu lys leu asp ile ser pro ala pro glu asn leu his 
241/81 271/91 

TTC TGC CTG TCC CCT GAC TTA CTT CAT GTC AAG CCC TAC CCT GAC CCC AGG GGA CGG CCC 
phe cys leu ser pro asp leu leu his val lys pro tyr pro asp pro arg gly arg pro 
301/101 331/111 

ACC AAG GAG ATC CTG GAG TTC CCT GCC CGC GAG GTC TAC GCC CCC CAT TCC TGT TAC AGG 
thr lys glu ile leu glu phe pro ala arg glu val tyr ala pro his ser cys tyr arg 
361/121 391/131 

AAC CTG CTC TTC GTG TAC CCA CAC AGC CTC AAT TTC AGC AGT CGC CAG GGC TCT GTG CGC 
asn leu leu phe val tyr pro his ser leu asn phe ser ser arg gin gly ser val arg 
421/141 451/151 

AAC CTG GCT GTG CGG ATC CAG TAC ATG GCC GCT GAA GAC CAG AGC CAG GCC TTG CCG GTC 
asn leu ala val arg ile gin tyr met ala gly glu asp gin ser gin ala leu pro val 
481/161 511/171 

ATC TTT GGG AAA TCT AGC TGC AGC GAA TTC ACC CGA GAG GCC TTC ACA CCA GTG GTC TAT 
ile phe gly lys ser ser cys ser glu phe thr arg glu ala phe thr pro val val tyr 
541/181 571/191 

CAC AAC AAG TCT CCT GAA TTC TAC GAG GAA TTC AAG CTA CGA CTT CCT GCC TGC GTG ACC 
his asn lys ser pro glu phe tyr glu glu phe lys leu arg leu pro ala cys val thr 
601/201 631/211 

GAG AAC CAT CAC CTC TTT TTT ACC TTC TAC CAT GTC AGC TGC CAG CCC CGG CCA GGA ACA 
glu asn his his leu phe phe thr phe tyr his val ser cys gin pro arg pro gly thr 
661/221 691/231 

GCC TTG GAG ACG CCT GTG GGC TTC ACT TGG ATC CCT CTG TTA CAA CAT GGC CGC CTG AGG 
ala leu glu thr pro val gly phe thr trp ile pro leu leu gin his gly arg leu arg 
721/241 751/251 

ACT GGT CCC TTC TGC CTG CCC GTG TCC GTG GAC CAG CCT CCA CCC AGC TAC TCG GTC CTG 
thr gly pro phe cys leu pro val ser val asp gin pro pro pro ser tyr ser val leu 
781/261 811/271 

ACC CCA GAT GTA GCG CTG CCT GGC ATG CGC TGG GTG GAC GGC CAC AAG GGC GTG TTC AGT 
thr pro asp val ala leu pro gly met arg trp val asp gly his lys gly val phe ser 
841/281 871/291 

GTG GAG CTC ACC GCC GTG TCG TCC GTG CAC CCG CAG GAC CCC CAC TTG GAT AAG TTC TTC 
val glu leu thr ala val ser ser val his pro gin asp pro his leu asp lys phe phe 
901/301 931/311 

ACG CTG GTA CAT GTT CTA GAG GAA GGG ATC TTT CCG TTC CGA CTC AAG GAG ACA GTG CTG 
thr leu val his val leu glu glu gly ile phe pro phe arg leu lys glu thr val leu 
961/321 991/331 

AGC GAA GGC ACT ATG GAG CAG GAG TTG CGA GCC AGC CTG GCA GCC CTG CGC CTT GCC AGC 
ser glu gly thr met glu gin glu leu arg ala ser leu ala ala leu arg leu ala ser 
1021/341 1051/351 

CCA GAG CCC CTA GTA GCC TTT TCC CAC CTC GTG CTA GAC AAG CTT GTC CGC TTG GTT GTG 
pro glu pro leu val ala phe ser his leu val leu asp lys leu val arg leu val val 

,4 
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1081/361 1111/371 

CGG CCA CCC ATC ATT TGT GGC CAG ATG GTG AAC CTG GGT CGA GGG GCC TTT GAA GCT ATG 
arg pro pro ile ile cys gly gin met val asn leu gly arg gly ala phe glu ala met 
1141/381 1171/391 

GCT CAT GTA GCC AGC CTT GTG CAC CGG AAC CTG GAG GCT GTC CAA GAC TCC CGT GGC CAC 
ala his val ala ser leu val his arg asn leu glu ala val gin asp ser arg gly his 
1201/401 1231/411 

TGC CCA CTG CTG GCC TCC TAT GTC CAC TAC GCC TTT CGC CTC CCT GGT GGT GAC CTC AGC 
cys pro leu leu ala ser tyr val his tyr ala phe arg leu pro gly gly asp leu ser 
1261/421 1291/431 

CTG CCA GGC GAA GCC CCT CCA GCA ACT GTG CAG GCT GCC ACA CTG GCC CGT GGC TCT GGT 
leu pro gly glu ala pro pro ala thr val gin ala ala thr leu ala arg gly ser gly 
1321/441 1351/451 

CGC CCC GCC AGC CTG TAC CTG GCA CGG TCT AAG AGC ATC AGC AGC AGC AAC CCT GAC CTG 

arg pro ala ser leu tyr leu ala arg ser lys ser ile ser ser ser asn pro asp leu 
1381/461 1411/471 

GCT GTG GTC CCT GGC TCT GTG GAC GAT GAG GTG TCC CGC ATC CTA GCC AGC AAG GGT GTC 
ala val val pro gly ser val asp asp glu val ser arg ile leu ala ser lys gly val 
1441/481 1471/491 

GAT CGC TCA CAC TCC TGG GTG AAT TCT GCT TAT GCT CCG GGA GGC AGC AAG GCT GTG CTG 
asp arg ser his ser trp val asn ser ala tyr ala pro gly gly ser lys ala val leu 
1501/501 1531/511 

CGG CGG GTG CCC CCA TAC TGT GGG GCT GAT CCC AGA CAG CTG CTG CAC GAG GAG CTG GCT 
arg arg val pro pro tyr cys gly ala asp pro arg gin leu leu his glu glu leu ala 
1561/521 

CTG CAG TGG GTG GTG AGC GGC AGT GCG GTT 
leu gin trp val val ser gly ser ala val 



2. mouse CIASP-7 cDKA fragmant 2 

1/1 31/11 

ATG GTA AAA AGC ATG GAG CTC CAT CTG CTT CTG GGC CAG CGA CTG GAC ACT CCC CGC AAG 
Met val lys ser met glu leu his leu leu leu gly gin arg leu asp thr pro arg lys 
61/21 91/31 

CTG CGC TTC CCT GGG CGC TTC CTG GAT GAC ATT GCC GCC CTG GTG GCT TCC GTG GGC CTG 
leu arg phe pro gly arg phe leu asp asp ile ala ala leu val ala ser val gly leu 
121/41 151/51 

GAA GTC ATC ACC AGA GTC CAT AAG GAC ATG AAG CTA GCT GAA CGC CTC AAC GCT AGC CTG 
glu val ile thr arg val his lys asp met lys leu ala glu arg leu asn ala ser leu 
181/61 211/71 

GCT TTC TTC CTC AGT GAC CTC CTG TCC ATA GCG GAC CGA GGC TAC ATT TTC AGC CTG GTA 
ala phe phe leu ser asp leu leu ser ile ala asp arg gly tyr ile phe ser leu val 
241/81 271/91 

CGC GCT CAC TAC AAG CAG GTG GCC ACT CGG CTC CAG TCT GCC CCC AAC CCA ACT GCA CTG 
arg ala his tyr lys gin val ala thr arg leu gin ser ala pro asn pro thr ala leu 
301/101 331/111 

CTC ACA CTA CGA ATG GAC TTC ACC CGC ATC CTG TGT AGC CAT GAG CAC TAT GTG ACC CTT 
leu thr leu arg met asp phe thr arg ile leu cys ser his glu his tyr val thr leu 
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361/121 

AAC CTC CCC TGT TGC CCC CTG TCG CCC CCA 
asn leu pro cys cys pro leu eer pro pro 
421/141 

ACC TCC CAG AGC TCC ACC TTC TCC AGC CAG 
thr ser gin ser ser thr phe ser ser gin 
481/161 

GAG CTG AGC GGG CCC TTC AGG CAG CAG CAC 
glu leu ser gly pro phe arg gin gin his 
541/181 

GCC CTG GCC CTG GAT CCG GAA GOT GAG GGG 
ala leu ala leu asp pro glu ala glu gly 
601/201 

GCC GTG CAC AGC CTG CTG TGC AGC CAC GAT 
ala val his ser leu leu cys ser his asp 
661/221 

AAC GCC AAG GTG GCC GAG CTG TAC CTG CCC 
asn ala lys val ala glu leu tyr leu pro 



391/131 

GCA TCC CCG TCC CCC TCG GTG TCC TCC ACT 
ala ser pro ser pro ser val ser ser thr 
451/151 

GCC CCC GAT CCG AAG GTG ACC AGC ATG TTC 
ala pro asp pro lys val thr ser met phe 
511/171 

TTC CTG TCC GGA CTC CTG CTG ACA GAG CTG 
phe leu ser gly leu leu leu thr glu leu 
571/191 

GCA TCC CTT TTG CAC AAG AAG GCT ATT AGT 
ala ser leu leu his lys lys ala ile ser 
631/211 

GTT GAC TCC CGC TAC GCA GAA GCT ACC GTG 
val asp ser arg tyr ala glu ala thr val 
691/231 

CTC CTC TCC CTT 
leu leu eer leu 



3. mouse CIASP-7 cDNA fragment 3 
1/1 

GAA ATG GAA CAT GAA GCT CTG GTG GAC GGA 
glu met glu his glu ala leu val asp gly 
61/21 

CTG GAC ACG CTG GAG ACC ATC GTG CAG ACA 
leu asp thr leu glu thr ile val gin thr 
121/41 

CTG AGT GCC GTG CTG AAA GTT GTC CTC TAC 
leu ser ala val leu lys val val leu tyr 
181/61 

CTG CAG CAT GGC CTG GCC ACC CAG CGG GCC 
leu gin his gly leu ala thr gin arg ala 
241/81 

GAG GAA GAC ACG GAG CTT TGT GCC GAC CTG 
glu glu asp thr glu leu cys ala asp leu 
301/101 

CGC ATC AGC ACC ATC CGC ATG CAC GCC AGC 

arg ile ser thr ile arg met his ala ser 
361/121 

TTC GAG ATT GGC CAT AAC TTT GCC CGT GTG 
phe glu ile gly his asn phe ala arg val 
421/141 

CTT GTG GGG ACA ACT CAG AAC TTT AGT GAA 
leu val gly thr thr gin asn phe ser glu 
481/161 

CTG ACC TAC GCA GAG GAG GAC ATA GGG CTG 
leu thr tyr ala glu glu asp ile gly leu 



31/11 

AAC CTG GCG ACA GAG GCC AGC CTG GTG GTT 
asn leu ala thr glu ala ser leu val val 
91/31 

GTG ATG CTG TCC GAG GCC CGT GAG AGC ATC 
val met leu ser glu ala arg glu ser ile 
151/51 

AGT CTT GGG AGC GCC CAG AGT GCC CTG TTC 
ser leu gly ser ala gin ser ala leu phe 
211/71 

CTG GTC TCC AAG TTT CCG GAG CTG CTT TTC 
leu val ser lys phe pro glu leu leu phe 
271/91 

TGC CTG AGA CTT CTG CGA CAC TGT GGC AGC 
cys leu arg leu leu arg his cys gly ser 
331/111 

GCC TCC CTC TAC CTG CTT ATG CGC CAG AAC 

ala ser leu tyr leu leu met arg gin asn 
391/131 

AAG ATG CTG GTG ACC ATG TCT CTG TCG TCC 
lys met leu val thr met ser leu ser ser 
451/151 

GAG CAT TTG AGA AAG TCC CTC AAG ACC ATC 
glu his leu arg lys ser leu lys thr ile 
511/171 

AGG GAC AGC ACC TTC GCT GAG CAG GTC CAG 
arg asp ser thr phe ala glu gin val gin 
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541/181 571/191 

GAC CTC ATG TTC AAC CTG CAC ATG ATC CTG ACA GAC ACG GTG AAG ATG AAG GAA CAC CAG 
asp leu met phe asn leu his met ile leu thr asp thr val lys met lys glu his gin 
601/201 631/211 

GAG GAC CCT GAG ATG CTC ATG GAC CTC ATG TAC AGA ATT GOT CGG GGA TAC CAA GGC TCT 
glu asp pro glu met leu met asp leu met tyr arg ile ala arg gly tyr gin gly ser 
661/221 691/231 

CCA GAC CTG CGG CTG ACA TGG CTG CAG AAC ATG GOT GGC AAA CAT GCA GAG CTG GGC AAT 
pro asp leu arg leu thr trp leu gin asn met ala gly lys his ala glu leu gly asn 
721/241 751/251 

CAT GCA GAG GCT GCC CAG TGC ATG GTC CAC GCC GCT GCC CTG GTG GCC GAA TAC CTT GCC 
his ala glu ala ala gin cys met val his ala ala ala leu val ala glu tyr leu ala 
781/261 811/271 

CTC CTC GAG GAC AGT CGG CAC CTG CCT GTG GGC TGT GTG TCC TTC CAG AAT GTC TCA TCC 
leu leu glu asp ser arg his leu pro val gly cys val ser phe gin asn val ser ser 
841/281 871/291 

AAC GTG CTG GAG GAG TCT GCC ATC TCT GAT GAC ATC CTG TCG CCA GAT GAG GAG GGC TTC 
asn val leu glu glu ser ala ile ser asp asp ile leu ser pro asp glu glu gly phe 
901/301 931/311 

TGC TCT GGG AAG AAC TTC ACA GAA CTG GGT CTG GTG GGG CTG CTG GAG CAG GCG GCC GGC 
cys ser gly lys asn phe thr glu leu gly leu val gly leu leu glu gin ala ala gly 
961/321 991/331 

TAC TTC ACC ATG GGT GGT CTG TAT GAA GCG GTG AAC GAA GTC TAC AAA AAC CTT ATC CCC 
tyr phe thr met gly gly leu tyr glu ala val asn glu val tyr lys asn leu ile pro 
1021/341 1051/351 

ATC CTT GAA GCC CAC AGA GAC TAC AAG AAG CTG GCT GCG GTG CAC GGG AAA CTG CAG GAG 
ile leu glu ala his arg asp tyr lys lys leu ala ala val his gly lys leu gin glu 
1081/361 1111/371 

GCC TTC ACC AAG ATT ATG CAC CAG AGC TCT GGC TGG GAG CGT GTA TTT GGG ACA TAT TTC 
ala phe thr lys ile met his gin ser ser gly trp glu arg val phe gly thr tyr phe 
1141/381 1171/391 

CGA GTG GGC TTC TAT GGC ACA CGA TTT GGT GAC CTG GAT GAA CAA GAG TTT GTG TAC AAG 
arg val gly phe tyr gly thr arg phe gly asp leu asp glu gin glu phe val tyr lys 
1201/401 1231/411 

GAA CCG TCA ATC ACG AAG CTT GCA GAG ATC TCA CAC CGG CTG GAG GAG TTC TAT ACG GAA 
glu pro ser ile thr lys leu ala glu ile ser his arg leu glu glu phe tyr thr glu 
1261/421 1291/431 

AGG TTC GGG GAT GAT GTG GTA GAG ATC ATC AAA GAT TCT AAC CCA GTG GAC AAG TCC AAG 
arg phe gly asp asp val val glu ile ile lys asp ser asn pro val asp lys ser. lys 
1321/441 1351/451 

CTG GAC CCA CAG AAG GCG TAC ATA CAG ATA ACC TAT GTG GAG CCA CAT TTC GAC ACT TAT 
leu asp pro gin lys ala tyr ile gin ile thr tyr val glu pro his phe asp thr tyr 
1381/461 1411/471 

GAG CTC AAG GAT CGG GTG ACC TAC TTC GAT CGG AAC TAT GGG CTG CGG GCC TTC CTC TTC 

glu leu lys asp arg val thr tyr phe asp arg asn tyr gly leu arg ala phe leu phe 
1441/481 1471/491 

TGC ACA CCC TTC ACA CCA GAT GGA CGT GCG CAC GGA GAG TTG GCC GAA CAG CAC AAA CGC 
cys thr pro phe thr pro asp gly arg ala his gly glu leu ala glu gin his lys arg 
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1501/501 1531/511 

AAG ACG CTG CTG AGC ACG GAG CAT GCC TTT CCC TAC ATC AAG ACA CGC ATC CGA GTG TGC 
lys thr leu leu ser thr glu his ala phe pro tyr ile lys thr arg ile arg val cys 
1561/521 1591/531 

CAC CGT GAG GAG ACA GTG CTG ACA CCA GTG GAG GTG GCC ATT GAG GAC ATG CAG AAG AAG 
his arg glu glu thr val leu thr pro val glu val ala ile glu asp met gin lys lys 
1621/541 1651/551 

ACC CGG GAG CTG GCC TTT GCC ACC GAG CAG GAC CCT CCA GAT GCC AAG ATG CTG CAG ATG 
thr arg glu leu ala phe ala thr glu gin asp pro pro asp ala lys met leu gin met 
1681/561 1711/571 

GTT CTG CAG GGT TCT GTG GGA CCC ACT GTG AAC CAG GGT CCC TTG GAA GTG GCC CAG GTG 
val leu gin gly ser val gly pro thr val asn gin gly pro leu glu val ala gin val 
1741/581 1771/591 

TTT TTG TCA GAG ATC CCA GAA GAT CCC AAG CTC TTC CGA CAT CAC AAC AAA CTC CGG CTC 
phe leu ser glu ile pro glu asp pro lys leu phe arg his his asn lys leu arg leu 
1801/601 1831/611 

TGC TTC AAG GAT TTC TGC AAA AAG TGC GAG GAT GCA CTG AGA AAG AAC AAG GCC CTG ATT 
cys phe lys asp phe cys lys lys cys glu asp ala leu arg lys asn lys ala leu ile 
1861/621 1891/631 

GGC CCA GAC CAG AAG GAG TAC CAC CGG GAG CTG GAG CGT CAC TAT AGC CGC CTG CGG GAG 
gly pro asp gin lys glu tyr his arg glu leu glu arg his tyr ser arg leu arg glu 
1921/641 1951/651 

GCT CTG CAG CCT CTG CTT ACC CAA CGT CTG CCC CAG CTG CTG GCA CCA AGT TCC ACC AGC 
ala leu gin pro leu leu thr gin arg leu pro gin leu leu ala pro ser ser thr ser 
1981/661 2011/671 

CTC AGG AGC TCC ATG AAC AGA TCA AGT TTC AGG AAG GCT GAC CTC TGA CAA GGC TAA GAG 
leu arg ser ser met asn arg ser ser phe arg lys ala asp leu OPA 
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-21 

GTCGCCGTCGCCGCAGCAGCC -1 
1/1 

ATG GCC GAG CGC CGC GCC TTC GCC CAG AAG ATC AGC AGA ACG GTG GCA GCC GAA GTT AGG 
Met ala glu arg arg ala phe ala gin lys ile ser arg thr val ala ala glu val arg 
61/21 " 91/31 

AAG CAG ATC TCC GGA CAA TAT AGT GGT TCT CCC CAA CTG CTC AAA AAC CTT AAT ATT GTT 
lys gin ile ser gly gin tyr ser gly ser pro gin leu leu lys asn leu asn ile val 
121/41 151/51 

GGC AAT ATA TCC CAT CAC ACC ACA GTG CCC CTT ACC GAA GCA GTA GAT CCA GTG GAT TTG 
gly asn ile ser his his thr thr val pro leu thr glu ala val asp pro val asp leu 
181/61 211/71 

GAA GAT TAC CTC ATT ACT CAT CCT TTG GCT GTG GAT TCT GGG CCT TTA CGG GAT TTG ATT 
glu asp tyr leu ile thr his pro leu ala val asp ser gly pro leu arg asp leu ile 
241/81 271/91 

GAA TTT CCT CCA GAT GAT ATT GAA GTT GTT TAT AGT CCT CGG GAC TGC AGA ACT CTT GTT 
glu phe pro pro asp asp ile glu val val tyr ser pro arg asp cys arg thr leu val 
301/101 331/111 

TCA GCT GTA CCT GAA GAA AGT GAA ATG GAT CCA CAT GTT AGA GAC TGT ATA AGA AGT TAT 
ser ala val pro glu glu ser glu met asp pro his val arg asp cys ile arg ser tyr 
361/121 391/131 

ACA GAA GAC TGG GCA ATT GTC ATC AGA AAA TAT CAT AAA TTG GGA ACA GGA TTT AAT CCC 
thr glu asp trp ala ile val ile arg lys tyr his lys leu gly thr gly phe asn pro 
421/141 451/151 

AAT ACA TTA GAT AAA CAG AAA GAA AGG CAA AAA GGT TTG CCA AAA CAA GTT TTT GAA TCT 
asn thr leu asp lys gin lys glu arg gin lys gly leu pro lys gin val phe glu ser 
481/161 511/171 

GAT GAA GCT CCA GAT GGC AAC AGC TAC CAG GAT GAT CAA GAT GAC CTT AAA AGA CGT TCA 
asp glu ala pro asp gly asn ser tyr gin asp asp gin asp asp leu lys arg arg ser 
541/181 571/191 

ATG TCA ATA GAT GAT ACC CCA AGG GGT AGC TGG GCC TGT AGT ATC TTT GAC TTG AAA AAT 
met ser ile asp asp thr pro arg gly ser trp ala cys ser ile phe asp leu lys asn 
601/201 631/211 

TCA CTT CCT GAT GCT TTG CTT CCC AAT TTA CTT GAT CGA ACT CCA AAT GAA GAA ATA GAC 
ser leu pro asp ala leu leu pro asn leu leu asp arg thr pro asn glu glu ile asp 
661/221 691/231 

CGT CAG AAT GAT GAC CAA AGG AAA TCA AAC CGT CAC AAA GAA CTT TTT GCT TTG CAT CCA 
arg gin asn asp asp gin arg lys ser asn arg his lys glu leu phe ala leu his pro 
721/241 751/251 

TCA CCA GAT GAG GAA GAA CCA ATA GAA CGG CTT AGT GTT CCT GAT ATA CCC AAA GAA CAT 
ser pro asp glu glu glu pro ile glu arg leu ser val pro asp ile pro lys glu his 
781/261 811/271 

TTT GGT CAA AGA CTT CTT GTA AAA . TGC TTA TCA CTC AAG TTT GAA ATT GAA ATT GAA CCC 
phe gly gin arg leu leu val lys cys leu ser leu lys phe glu ile glu ile glu pro 
841/281 871/291 

ATT TTT qqx AGT TTG GCT TTA TAT GAT GTC AAG GAA AAG AAA AAG ATT TCA GAA AAC TTT 
ile phe ala ser leu ala leu tyr asp val lys glu lys lys lys ile ser glu asn phe 
901/301 931/311 

TAT TTT GAC CTT AAT TCT GAG CAG ATG AAA GGG TTG TTA CGT CCA CAT GTA CCA CCT GCT 
tyr phe asp leu asn ser glu gin met lys gly leu leu arg pro his val pro pro ala 
961/321 991/331 

GCC ATT ACT ACC CTG GCA AGA TCA GCA ATT TTT TCT ATC ACT TAT CCT TCC CAA GAT GTT 
ala ile thr thr leu ala arg ser ala ile phe ser ile thr tyr pro ser gin asp val 
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1021/341 

TTT CTT GTA ATA AAG 
phe leu val ile lys 
1081/361 

CCA TAT ATG ATT TTC 
pro tyr met ile phe 
1141/381 

AAG AGT CAA GCA GAT 
lys ser gin ala asp 
1201/401 

ACT GCA ATC CAT TTA 
thr ala ile his leu 
1261/421 

GAA GTA GAA ATC AGT 
glu val glu ile ser 
1321/441 

ATT GTT GGC AGA CGA 
ile val gly arg arg 
1381/461 

AGC TTT CGA CCA GCT 
ser phe arg pro ala 
1441/481 

AGT GAT GAA GAT CTC 
ser asp glu asp leu 
1501/501 

CGA CTA AGA CCT ATT 
arg leu arg pro ile 
1561/521 

CAT TAT TGC CTA ACT 
his tyr cys leu thr 
1621/541 

CCT ACC AGA GAA ATC 
pro thr arg glu ile 
1681/561 

AGA AAT CTT CTC TAC 
arg asn leu leu tyr 
1741/581 

AGA AAT ATA ACA GTG 
arg asn ile thr val 
1801/601 

GTA ATC TTT GGT AAA 
val ile phe gly lys 
1861/621 

TAT CAT AAC AGG TCT 
tyr his asn arg ser 
1921/641 

ACT GAC CAT CAT CAC 
thr asp his his his 
1981/661 

ACT CCT CTT GAA ACA 
thr pro leu glu thr 
2041/681 

AAG ACT GGC CAG TTT 
lys thr gly gin phe 
2101/701 

CTG TCT CCT GAG GTT 
leu ser pro glu val 
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2161/721 2191/731 

AAT GTT GAA GTT GTT GCT GTT TCG TCT ATC CAT ACA CAA GAT CCT TAT CTT GAC AAA TTT 
asn val glu val val ala val ser ser ile his thr gin asp pro tyr leu asp lys phe- 
2221/741 2251/751 

TTT GCT CTG GTC AAT GCT CTG GAT GAA CAC CTG TTC CCA GTC CGA ATT GGG GAC ATG CGA 
phe ala leu val asn ala leu asp glu his leu phe pro val arg ile gly asp met arg 
2281/761 2311/771 

ATC ATG GAA AAT AAC TTA GAA AAT GAA TTG AAG AGC AGT ATT TCA GCA CTG AAT TCA TCC 
ile met glu asn asn leu glu asn glu leu lys ser ser ile ser ala leu asn ser ser 
2341/781 2371/791 

CAG CTG GAA CCA GTG GTC CGA TTT CTT CAT CTT CTG CTA GAT AAA CTG ATA CTT TTA GTT 
gin leu glu pro val val arg phe leu his leu leu leu asp lys leu ile leu leu val 
2401/801 2431/811 

ATT AGA CCT CCT GTC ATT GCT GGC CAA ATA GTT AAC CTA GGT CAA GCA TCT TTT GAA GCC 
ile arg pro pro val ile ala gly gin ile val asn leu gly gin ala ser phe glu ala 
2461/821 2491/831 

ATG GCA TCA ATT ATA AAT CGA CTT CAC AAA AAC TTG GAA GGA AAT CAT GAC CAG CAT GGC 
met ala ser ile ile asn arg leu his lys asn leu glu gly asn his asp gin his gly 
2521/841 2551/851 

AGA AAC AGC CTT CTT GCA TCA TAT ATT CAT TAT GTT TTC CGC CTA CCA AAT ACT TAC CCT 
arg asn ser leu leu ala ser tyr ile his tyr val phe arg leu pro asn thr tyr pro 
2581/861 2611/871 

AAT TCA TCA TCA CCA GGT CCT GGG GGT TTG GGA GGA TCA GTG CAT TAT GCC ACA ATG GCT 
asn ser ser ser pro gly pro gly gly leu gly gly ser val his tyr ala thr met ala 
2641/881 2671/891 

AGA TCT GCG GTG AGA CCT GCA AGC CTT AAT TTA AAT CGT TCT CGA AGC CTT AGT AAT AGC 
arg ser ala val arg pro ala ser leu asn leu asn arg ser arg ser leu ser asn ser 
2701/901 2731/911 

AAT CCA GAT ATA TCT GGG ACT CCC ACG TCA CCA GAT GAT GAA GTT CGA TCA ATC ATC GGG 
asn pro asp ile ser gly thr pro thr ser pro asp asp glu val arg ser ile ile gly 
2761/921 2791/931 

AGT AAG GGT TTA GAT CGC TCC AAT TCC TGG GTT AAC ACT GGT GGT CCA AAA GCT GCC CCA 
ser lys gly leu asp arg ser asn ser trp val asn thr gly gly pro lys ala ala pro 
2821/941 2851/951 

TGG GGA TCC AAC CCC AGT CCA AGT GCA GAA TCA ACA CAG GCT ATG GAT CGA AGT TGT AAT 
trp gly ser asn pro ser pro ser ala glu ser thr gin ala met asp arg ser cys asn 
2881/961 2911/971 

CGT ATG TCT TCG CAC ACA GAG ACG TCA AGT TTC TTA CAA ACA TTA ACG GGA CGC TTA CCA 
arg met ser ser his thr glu thr ser ser phe leu gin thr leu thr gly arg leu pro 
2941/981 2971/991 

ACT AAA AAG CTT TTT CAC GAG GAG CTG GCT TTG CAG TGG GTT GTT TGC AGT GGC AGC GTT 
thr lys lys leu phe his glu glu leu ala leu gin trp val val cys ser gly ser val 
3001/1001 3031/1011 

CGG GAA TCA GCT TTG CAA CAA GCC TGG TTC TTT TTT GAA TTA ATG GTA AAG AGC ATG GTG 
arg glu ser ala leu gin gin ala trp phe phe phe glu leu met val lys ser met val 
3061/1021 3091/1031 

CAC CAT TTA TAC TTT AAT GAT AAA CTT GAG GCT CCA AGG AAA AGT CGT TTT CCA GAA CGT 
his his leu tyr phe asn asp lys leu glu ala pro arg lys ser arg phe pro glu arg 
3121/1041 3151/1051 

TTC ATG GAT GAC ATT GCA GCT CTT GTC AGC ACG ATT GCT AGT GAT ATA GTT TCA CGA TTT 
phe met asp asp ile ala ala leu val ser thr ile ala ser asp ile val ser arg phe 
3181/1061 3211/1071 

CAG AAG GAC ACA GAA ATG GTT GAG AGA CTC AAT ACA AGC CTT GCA TTC TTT CTC AAT GAT 
gin lys asp thr glu met val glu arg leu asn thr ser leu ala phe phe leu asn asp 
3241/1081 3271/1091 

CTG TTG TCT GTT ATG GAC AGA GGA TTT GTT TTT AGC CTT ATA AAG TCC TGC TAT AAA CAG 
leu leu ser val met asp arg gly phe val phe ser leu ile lys ser cys tyr lys gin 
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3301/1101 

GTG TCT TCA AAG CTT TAC TCA TTA CCG AAT 
val ser ser lys leu tyr. ser leu pro asn 
3361/1121 

TTT CTA CGA ATC ATC TGC AGT CAT GAG CAC 
phe leu arg ile ile cys ser his glu his 
3421/1141 

CTT ACT CCA CCT GCA TCT CCA TCA CCT TCT 
leu thr pro pro ala ser pro ser pro ser 
3481/1161 

TTT TCT ACG AAT GTA CAA GAC CAA AAG ATT 
phe ser thr asn val gin asp gin lys ile 
3541/1181 

CGC CAA CAG CAT TAT TTG GCA GGA CTT GTG 
arg gin gin his tyr leu ala gly leu val 
3601/1201 

GAT GCT GAA GGA CTG TTT GGA TTG CAT AAG 
asp ala glu gly leu phe gly leu his lys 
3661/1221 

TCC AGT CAC GAC TCA GAC CCG CGG TAC TCT 
ser ser his asp ser asp pro arg tyr ser 
3721/1241 

TTG TAT CTA CCT CTG ATT GGT ATT ATC ATG 
leu tyr leu pro leu ile gly ile ile met 
3781/1261 

GAA ACT CAC AAT CAA CGA GGA AGA CCA ATT 
glu thr his asn gin arg gly arg pro ile 
3841/1281 

AGC GGA AGT ATG ATA AGC CAG ACC GTT GCC 
ser gly ser met ile ser gin thr val ala 
3901/1301 

CTA ACA AGG CCT GGC AGT TTC CTC CTC ACG 
leu thr arg pro gly ser phe leu leu thr 
3961/1321 

TCA GCA GAA TCA AGT CGA AGC CTT TTG ATC 
ser ala glu ser ser arg ser leu leu ile 
4021/1341 

GAA ACA GTT CTA CAG AAG TGG TTT ACA GAT 
glu thr val leu gin lys trp phe thr asp 
4081/1361 

GAT CTG CTT TAT CTC TGT GTG TCT TGC TTT 
asp leu leu tyr leu cys val ser cys phe 
4141/1381 

ATG AAT AGC TTG ACC TTT AAG AAA TCA AAA 
met asn ser leu thr phe lys lys ser lys 
4201/1401 

CTT GGG AGC ATA GGT GCC AGG CAA GAA ATG 
leu gly ser ile gly ala arg gin glu met 
4261/1421 

AGC CCA TCT GGA AGT GCC TTT GGA AGT CAA 
ser pro ser gly ser ala phe gly ser gin 
4321/1441 

CAC TGG CGT CAA AAC ACA GAG AAG CTT GAC 
■ his trp arg gin asn thr glu lys leu asp 
4381/1461 

CTG ATT GAT GGA AAC CTG GCT ACA GAA GCA 
leu ile asp gly asn leu ala thr glu ala 



3331/1111 

CCC AGT GTT CTG GTG TCC TTG AGG CTG GAT 
pro ser val leu val ser leu arg leu asp- 
3391/1131 

TAT GTT ACA TTA AAC TTA CCC TGC AGC TTA 
tyr val thr leu asn leu pro cys ser leu 
3451/1151 

GTT TCT TCT GCA ACA TCT CAG AGT TCT GGA 
val ser ser ala thr ser gin ser ser gly 
3511/1171 

GCA AAT ATG TTT GAA TTA TCC GTG CCT TTC 
ala asn met phe glu leu ser val pro phe 
3571/1191 

TTA ACA GAG CTG GCT GTC ATT TTA GAC CCT 
leu thr glu leu ala val ile leu asp pro 
3631/1211 

AAA GTC ATC AAT ATG GTA CAC AAT TTA CTC 
lys val ile asn met val his asn leu leu 
3691/1231 

GAC CCT CAG ATA AAG GCT CGA GTG GCC ATG 
asp pro gin ile lys ala arg val ala met 
3751/1251 

GAA ACT GTA CCT CAG CTG TAT GAT TTT ACA 
glu thr val pro gin leu tyr asp phe thr 
3811/1271 

TGT ATA GCC ACT GAT GAT TAT GAA AGT GAG 
cys ile ala thr asp asp tyr glu ser glu 
3871/1291 

ATG GCA ATC GCA GGG ACA TCG GTC CCT CAA 
met ala ile ala gly thr ser val pro gin 
3931/1311 

TCA ACG AGT GGC AGG CAA CAC ACT ACC TTT 
ser thr ser gly arg gin his thr thr phe 
3991/1331 

TGT CTA CTT TGG GTT CTC AAA AAT GCA GAT 
cys leu leu trp val leu lys asn ala asp 
4051/1351 

CTC TCA GTC TTG CAG CTA AAC CGG CTA TTA 
leu aer val leu gin leu asn arg leu leu 
4111/1371 

GAG TAT AAA GGG AAA AAA GTG TTT GAA CGA 
glu tyr lys gly lys lys val phe glu arg 
4171/1391 

GAC ATG AGA GCA AAG CTT GAA GAA GCT ATT 
asp met arg ala lys leu glu glu ala ile 
4231/1411 

GTA CGG CGA AGC CGA GGA CAG CTC GAG AGA 
val arg arg ser arg gly gin leu glu arg 
4291/1431 

GAA AAT TTG AGG TGG AGG AAA GAT ATG ACT 
glu asn leu arg trp arg lys asp met thr 
4351/1451 

AAA TCA AGA GCA GAG ATT GAA CAC GAA GCA 
lys ser arg ala glu ile glu his glu ala 
4411/1471 

AAC CTA ATC ATT TTA GAT ACA TTA GAG ATT 
asn leu ile ile leu asp thr leu glu ile 
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AGT 


GTG 


GAG 


asp gly his 


lys 


gly 


val 


phe 


ser 


val 


glu 


2161/721 
















GAC CCC TAC 


CTG 


GAC 


AAA 


TTC 


TTC 


ACC 


CTG 


asp pro tyr 


leu 


asp 


lys 


phe 


phe 


thr 


leu 


2221/741 
















TTC CGG CTC 


AAG 


GAC 


ACT 


GTG 


CTG 


AGC 


GAG 


phe arg leu 


lys 


asp 


thr 


val 


leu 


ser 


glu 


2281/761 
















CTT GCA GCA 


CTG 


CGC 


CTG 


GCC 


AGC 


CCC 


GAA 



GAG AAG CTA 


GAG 


AAG 


CTG 


CGC 


CTG 


GCG 


GCC 


glu lys leu 


glu 


lys 


leu 


arg 


leu 


ala 


ala 


1171/391 
















CGC ATG CCC 


TTC 


GCC 


TGG 


ACG 


GCC 


GTG 


CAC 


arg met pro 


phe 


ala 


trp 


thr 


ala 


val 


his 


1231/411 
















CTG GAC CGG 


GAC 


TCT 


GAC 


TCG 


GAG 


GGC 


GAG 


leu asp arg 


asp 


ser 


asp 


ser 


glu 


gly 


glu 


1291/431 
















CGG GGG CCC 


CAG 


GAC 


CGG 


GCG 


AGT 


AGT 


GGG 


arg gly pro 


gin 


asp 


arg 


ala 


ser 


ser 


gly 


1351/451 
















CCA GCC ACG 


CTA 


ACT 


GTC 


ACA 


AAC 


TTC 


TTT 


pro ala thr 




thr 


val 


thr 


asn 


phe 


phe 


1411/471 
















GAC CTC TTC 


AAG 


TTC 


CTG 


GCT 


GAC 


ATG 


AGG 


asp leu phe 


lys 


phe 


leu 


ala 


asp 


met 


arg 


1471/491 
















CCT GTG ACT 


GCC 


CAG 


CTC 


AAG 


ATC 


GAC 


ATT 


pro val thr 


ala 


gin 


leu 


lys 


ile 


asp 


ile 


1531/511 
















CTC TCC CCT 


GAG 


CTG 


CTT 


CAT 


ATC 


AAG 


CCC 


leu ser pro 


g 


leu 


leu 


his 


ile 


lys 


pro 


1591/531 
















GAG ATT CTG 


GAG 


TTC 


CCC 


GCC 


CGC 


GAA 


GTC 


glu ile leu 


glu 


phe 


pro 


ala 


arg 


glu 


val 


1651/551 
















CTG TAC GTG 


TAC 


CCG 


CAC 


AGC 


CTC 


AAC 


TTC 


leu tyr val 


tyr 


pro 


his 


ser 


leu 


asn 


phe 


1711/571 
















GCT GTG CGA 


GTG 


CAG 


TAC 


ATG 


ACA 


GGC 


GAG 


ala val arg 


val 


gin 


tyr 


met 


thr 


gly 


glu 


1771/591 
















GGC AAG TCC 


AGC 


TGC 


AGT 


GAA 


TTT 


ACC 


CGC 


gly lys ser 




cys 




g 


phe 


thr 


arg 


1831/611 
















AAG TCC CCC 


GAG 


TTC 


TAC 


GAG 


GAG 


TTC 


AAG 


lys ser pro 


glu 




tyr 


glu 


glu 


phe 


lys 


1891/631 
















CAT CAC CTG 


CTG 


TTC 


ACC 


TTC 


TAC 


CAT 


GTC 


his his leu 


leu 


phe 


thr 


phe 


tyr 


his 


val 


1951/651 
















GAG ACA CCC 


GTG 


GGC 


TTT 


ACT 


TGG 


ATC 


CCA 


glu thr pro 


val 


gly 


phe 


thr 


trp- 


ile 


pro 


2011/671 
















CCC TTC TGT 


CTC 


CCA 


GTG 


TCT 


GTG 


GAC 


CAG 


pro phe cys 


leu 


pro 


val 


ser 


val 


asp 


gin 


2071/691 
















GAT GTG GCG 


CTT 


CCG 


GGC 


ATG 


CGC 


TGG 


GTG 


asp val ala 


leu 


pro 


gly 


met 


arg 


trp 


val 


2131/711 
















CTC ACA GCC 


GTG 


TCC 


TCT 


GTG 


CAC 


CCC 


CAG 


leu thr ala 


val 


ser 


ser 


val 


his 


pro 


gin 


2191/731 
















GTG CAC GTC 


CTG 


GAG 


GAG 


GGA 


GCC 


TTC 


CCA 


val his val 


leu 


glu 


glu 


gly 


ala 


phe 


pro 


2251/751 
















GGC AAC GTG 


GAG 


CAG 


GAG 


CTG 


CGG 


GCC 


AGT 


gly asn val 


glu 


gin 


glu 


leu 


arg 


ala 


ser 


2311/771 
















CCC CTT GTG 


GCC 


TTC 


TCC 


CAC 


CAC 


GTG 


CTG 
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leu 


arg 


leu 


ala 


ser 


pro 


glu 


2341/781 
















GAC AAG CTC 


GTG 


CGT 


CTG 


GTC 


ATC 


AGG 


CCC 


asp lys leu 


val 


arg 


leu 


val 


ile 


arg 


pro 


2401/801 
















GGC CGT GGA 


GCC 


TTT 


GAA 


GCA 


ATG 


GCC 


CAT 


gly arg gly 




phe 


glu 


ala 


met 


ala 


his 


2461/821 
















GCA GCC CAG 


GAT 


GCC 


CGC 


GGT 


CAC 


TGC 


CCA 


ala ala gin 


asp 


ala 


arg 


gly 


his 


cys 


pro 


2521/841 
















CGC CTT CCT 


GGC 


ACT 


GAG 


CCC 


AGC 


CTC 


CCG 


arg leu pro 


gly 


thr 


glu 


pro 




leu 


pro 


2581/861 
















GCC ACA CTG 


GCC 


CGT 


GGC 


TCT 


GGT 


CGC 


CCC 


ala thr leu 




arg 


gly 




gly 


arg 


pro 


2641/881 
















ATC AGC AGC 


AGC 


AAC 


CCT 


GAC 


CTC 


GCC 


GTG 


lie set ser 






pro 


asp 


leu 


ala 


val 


'2701/901 
















CGC ATC CTG 


GCC 


AGC 


AAG 


CTG 


CTT 


CAC 


GAG 


arg ile leu 


ala 




lys 


leu 


leu 


his 


glu 


2761/921 
















AGT GCC GTA 


CGC 


GAG 


GCC 


ATC 


CTC 


CAG 


CAC 


ser ala val 


arg 


glu 


ala 


ile 


leu 


gin 


his 


2821/941 
















AGT ATG GCG 


CTG 


CAC 


CTG 


CTG 


CTT 


GGC 


CAG 


ser met ala 


leu 






leu 


leu 


gly 


gin 


2881/961 
















CCC GGA CGC 


TTC 


CTG 


GAC 


GAC 


ATC 


ACT 


GCC 


pro gly arg 


phe 


leu 


asp 


asp 


ile 


thr 




2941/981 
















ACC CGT GTC 


CAC 


AAG 


GAT 


GTG 


GAG 


CTG 


GCC 


thr arg val 


his 


lys 


asp 


val 


glu 






3001/1001 
















CTC AGT GAC 


CTT 


CTG 


TCC 


CTG 


GTG 


GAC 


CGG 


leu ser asp 


leu 




ser 


leu 


val 


asp 


arg 


3061/1021 
















TAC AAG CAG 


GTG 


GCC 


ACG 


CGG 


CTC 


CAG 


TCG 


tyr lys gin 


val 




thr 


arg 


leu 


gin 


ser 


3121/1041 
















CGC ATG GAA 


TTC 


ACC 


CGC 


ATC 


CTG 


TGC 


AGC 


arg met glu 


phe 


thr 


arg 


ile 


leu 


cys 




3181/1061 
















TGC TGC CCC 


CTG 


TCA 


CCT 


CCA 


GCC 


TCG 


CCC 


cys cys pro 






pro 










3241/1081 
















AGC TCC ACC 


TTC 


TCC 


AGC 


CAA 


GCC 


CCG 


GAC 


ser ser thr 


phe 






gin 


ala 


pro 


asp 


3301/1101 
















GGA CCA TTC 


CGG 


CAG 


CAG 


CAC 


TTC 


CTA 


GCT 


gly pro phe 


arg 


gin 


gin 


his 


phe 


leu 


ala 


3361/1121 
















CTC GAA CCT 


GAG 


GCT 


GAA 


GGG 


GCA 


TTC 


CTG 


leu glu pro 


glu 


ala 


glu 


gly 


ala 


phe 


leu 


3421/1141 
















AGC CTG CTA 


TGT 


GGC 


CAT 


GAC 


ACT 


GAC 


CCC 


ser leu leu 


cys 


gly 


his 


asp 


thr 


asp 


pro 


3481/1161 
















GTG GCC GAG 


CTG 


TAC 


CTG 


CCA 


CTG 


CTT 


TCG 


val ala glu 


leu 


tyr 


leu 


pro 


leu 


leu 


ser 



pro leu val 


ala 


phe 


ser 


his 


his 


val 


leu 


2371/791 
















CCG ATC ATC 


AGT 


GGC 


CAG 


ATT 


GTG 


AAC 


CTG 


pro ile ile 


ser 


gly 


gin 


ile 


val 


asn 


leu 


2431/811 
















GTA GTC AGC 


CTT 


GTT 


CAC 


CGG 


AGC 


CTG 


GAG 


val val ser 


leu 


val 


his 


arg 


ser 


leu 


glu 


2491/831 
















CAG CTG GCT 


GCC 


TAC 


GTC 


CAC 


TAC 


GCC 


TTT 


gin leu ala 


ala 


tyr 


val 


his 


tyr 


ala 


phe 


2551/851 
















GAT GGG GCC 


CCT 


CCA 


GTG 


ACA 


GTG 


CAG 


GCT 


asp gly ala 


pro 


pro 


val 


thr 


val 


gin 


ala 


2611/871 
















GCA AGC CTC 


TAC 


CTG 


GCG 


CGT 


TCC 


AAG 


AGC 


ala ser leu 


tyr 


leu 


ala 


arg 


ser 


lys 


ser 


2671/891 
















GCC CCT GGC 


TCT 


GTG 


GAT 


GAC 


GAG 


GTT 


TCC 


ala pro gly 




val 


asp 


asp 


glu 


val 


ser 


2731/911 
















GAG CTG GCT 


CTG 


CAG 


TGG 


GTG 


GTC 


AGC 


AGC 


glu leu ala 


leu 


gin 


trp 


val 


val 


ser 


ser 


2791/931 
















GCC TGG TTC 


TTC 


TTC 


CAG 


CTC 


ATG 


GTG 


AAG 


ala trp phe 


phe 


phe 


gin 


leu 


met 


val 


lys 


2851/951 
















CGA CTA GAC 


ACA 


CCC 


CGC 


AAG 


CTG 


CGC 


TTC 


arg leu asp 


thr 


pro 


arg 


lys 


leu 


arg 


phe 


2911/971 
















TTG GTG GGC 


TCT 


GTG 


GGC 


CTG 


GAG 


GTC 


ATC 


leu val gly 




val 






glu 


val 


ile 


2971/991 
















GAG CAC CTC 


AAC 


GCC 


AGC 


CTG 


GCT 


TTC 


TTC 


glu his leu 




ala 








P © 




3031/1011 
















GGC TTT GTC 


TTC 


AGC 


CTG 


GTC 


CGG 


GCC 


CAC 


gly phe val 


phe 




leu 


val 


arg 


ala 


his 


3091/1031 
















TCC CCT AAT 


CCA 


GCA 


GCC 


CTG 


CTG 


ACC 


CTG 


ser pro asn 


pro 


ala 


ala 


leu 


leu 


thr 


leu 


3151/1051 
















CAC GAG CAC 


TAC 


GTG 


ACC 


CTC 


AAC 


CTC 


CCC 


his glu his 


tyr 


val 


thr 


leu 


asn 


leu 


pro 


3211/1071 
















TCC CCC TCT 


GTG 


TCC 


TCC 


ACC 


ACC 


TCC 


CAG 


ser pro ser 


val 


ser 


ser 


thr 


thr 


ser 


gin 


3271/1091 
















CCC AAG GTG 


ACC 


AGC 


ATG 


TTC 


GAA 


CTG 


AGT 


pro lys val 


thr 


ser 


met 


phe 


glu 


leu 


ser 


3331/1111 
















GGG CTC CTG 


CTG 


ACG 


GAG 


CTG 


GCA 


CTG 


GCC 


gly leu leu 


leu 


thr 


glu 


leu 


ala 


leu 


ala 


3391/1131 
















TTG CAC AAG 


AAG 


GCC 


ATC 


AGT 


GCT 


GTG 


CAC 


leu his lys 


lys 


ala 


ile 


ser 


ala 


val 


his 


3451/1151 
















CGC TAC GCC 


GAG 


GCC 


ACT 


GTG 


AAG 


GCT 


CGT 


arg tyr ala 


glu 


ala 


thr 


val 


lys 


ala 


arg 


3511/1171 
















ATT GCA CGG 


GAT 


ACC 


TTG 


CCA 


CGG 


CTG 


CAT 


ile ala arg 


asp 


thr 


leu 


pro 


arg 


leu 


his 
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GAC TTT GCT 


GAG 


GGC 


CCA 


GG 




rrr 


TCA 


AGA CTG GCC 


TCA ATG 


CTT 


GAC 


TCA 


GAC 


ACA 


asp phe ala 


glu 


gly 


pro 


gly 


gin 


arg 


ser 


arg leu ala 


ser met 


leu 


asp 


ser 


asp 


thr 


3601/1201 
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GAA GGC GAA 


GGG 


G 


ATT 




rrr 




ATC 


AAC CCC TCT 


GTG GCC 


ATG 


rrr 
GCC 


ATT 


rr-v 
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GGT 
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gly 


r 
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met 
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ile 
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gly 
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GGC CCC CTA 


GCC 


CCT 


GGC 


TCC 
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CGG 


GCC 
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arg 


ala 


ser 


ile ser gin 
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3751/1251 














GCA GGC TGT 


GCC 


CTC 


TCT 


GCT 


GAG 


TCA 




CGG ACC TTG 


CTG GCG 


TGT 


GTG 


CTG 


TGG 


GTG 


ala gly cys 


ala 


leu 


ser 


ala 


glu 


ser 


se C 


arg thr leu 


leu ala 


cys 


val 


leu 


trp 


val 


3781/1261 
















3811/1271 














CTG AAA AAC 


ACC 


GAG 


CCG 


GCG 


CTC 


CTG 


CAG 


CGC TGG GCC 


ACT GAC 


CTG 


ACA 


CTC 


CCC 


CAG 


leu lys asn 


thr 


glu 


pro 


ala 


leu 


leu 


gin 


arg trp ala 


thr asp 


leu 


thr 


leu 


pro 


gin 


3841/1281 
















3871/1291 














CTG GGA CGT 


CTG 


TTG 


GAC 


TTG 


CTG 


TAC 


CTT 


TGC CTG GCT 


GCC TTT 


GAG 


TAC 


AAG 


GGG 


AAA 


leu gly arg 


leu 


leu 


asp 


leu 


leu 


tyr 


leu 


cys leu ala 


ala phe 


glu 


tyr 


lys 


gly 


lys 


3901/1301 
















3931/1311 














AAG GCC TTT 


GAA 


CGC 


ATC 


AAC 


AGC 


CTC 


ACA 


TTC AAA AAA 


TCT CTG 


GAT 


ATG 


AAG 


GCG 


CGG 


lys ala phe 


glu 


arg 


ile 


asn 


ser 


leu 


thr phe lys lys 


ser leu 


asp 


met 


lys 


ala 


arg 


3961/1321 
















3991/1331 














CTA GAG GAA 


GCC 


ATT 


CTG 


GGT 


ACC 


ATC 


GGA 


GCT CGA CAA 


GAA ATG 


GTT 


CGG 


CGA AGT 


CGT 


leu glu glu 


ala 


ile 


leu 


gly 


thr 


ile 


gly ala arg gin 


glu met val 


arg 


arg 


ser 


arg 


4021/1341 
















4051/1351 














GAG AGG AGC 


CCG 


TTT 


GGG 


AAT 


CCA 


GAG 


AAC 


GTG CGC TGG 


CGG AAG 


AGC 


GTC 


ACA 


CAC 


TGG 


glu arg ser 


pro 


phe 


gly 


asn 


pro 


glu 


asn 


val arg trp 


arg lys 


ser 


val 


thr 


his 


trp 


4081/1361 
















4111/1371 














AAG CAA ACC 


TCA 


GAC 


CGC 


GTG 


GAC 


AAG 


ACC 


AAG GAT GAA 


ATG GAA 


CAC 


GAG. 


GCC 


TTG 


GTG 


lys gin thr 


ser 


asp 


arg 


val 


asp 


lys 


thr 


lys asp glu 


met glu 


his 


glu 


ala 


leu 


val 


4141/1381 
















4171/1391 














GAA GGG AAC 


CTG 


GCA 


ACC 


GAG 


GCA 


AGC 


CTA 


GTG GTT CTG 


GAC ACA 


CTG 


GAG 


ATC 


ATC 


GTG 


glu gly asn 


leu 


ala 


thr 


glu 


ala 


ser 


leu 


val val leu 


asp thr 


leu 


glu 


ile 


ile 


val 


4201/1401 
















4231/1411 














CAG ACG GTG 


ATG 


CTT 


TCA 


GAA 


GCC 


CGG 


GAG 


AGC GTC TTG 


GGG GCA 


GTG 


CTG 


AAG 


GTT 


GTG 


gin thr val 


met 


leu 


ser 


glu 


ala 


arg 


glu 


ser val leu 


gly ala val 


leu 


lys 


val 


val 


4261/1421 
















4291/1431 














CTG TAC AGC 


CTG 


GGC 


AGT 


GCC 


CAG 


AGT 


GCC 


CTC TTC TTG 


CAG CAT 


GGC 


CTG 


GCC 


ACC 


CAG 


leu tyr ser 


leu 


gly 


ser 


ala 


gin 


ser 


ala 


leu phe leu 


gin his 


gly 


leu 


ala 


thr gln- 


4321/1441 
















4351/1451 














AGG GCC . CTT 


GTG 


TCC 


AAG 


TTC 


CCG 


GAG 


CTG 


CTG TTC GAG 


GAG GAC 


ACG 


GAG 


CTG 


TGT 


GCC 


arg ala leu 


val 


ser 


lys 


phe 


pro 


glu 


leu 


leu phe glu 


glu asp thr glu leu 


cys 


ala 


4381/1461 
















4411/1471 














GAC CTG TGC 


CTG 


AGG 


CTC 


CTA 


CGA 


CAC 


TGT 


GGC AGC CGC 


ATC AGC 


ACC 


ATC 


CGC ACG 


CAC 


asp leu cys 


leu 


arg 


leu 


leu 


arg 


his 


cys 


gly ser arg 


ile ser 


thr 


ile 


arg 


thr 


his 


4441/1481 
















4471/1491 














GCC AGC GCC 


TCG 


CTG 


TAC 


CTG 


CTC 


ATG 


CGA 


CAG AAC TTC 


GAG ATC 


GGC 


CAC 


AAC 


TTT 


GCC 
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met 
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CTG 
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TCA 


CTC 


AAA 


ACC ATC CTC 


ACC TAT 


GCT 


GAG 


GAG 
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ATG 


ser glu glu 
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arg 


arg 


ser 


leu 


lys 


thr ile leu 


thr tyr ala glu glu asp met 


4621/1541 
















4651/1551 














GGG CTG CGG 


GAC 


AGC 


ACC 


TTC 


GCA 


GAG 


CAG 


GTC CAG GAC 


CTG ATG 


TTC 


AAC 


CTG 
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ATG 


gly leu arg 


asp 


ser 
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glu 


gin 


val gin asp 


leu met phe 
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met 
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asp 
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CCCAAATGGACTCTGACCAGATTTTTGCCATACTGGGGGGTGGCGGGATGGAGGATGGGTACTCAGGCATGACTGCGTAT 
TTATTAAAGTGTGTTTTTCCACAATGTACCAAACAAGGCATAAGCAGCTTCTCCTGCTGACTGGCCAATCACTGCCCATC 
TGAGAGATGATTTCCTCTGGCCCATATTTGAATTTATTGGAGTAACTCAAATTGCCTGAGGAAAAATGGAAAAATTATCC 
ACCAGTCGATTCAAACTGAATTTCACTCTTTATAGGAAGGCAGGGCAAACTTGTAGGAGTACGAAACATTTTCAATAAAT 
CTACAAAGGGAAGCCTTACTACAATTCCAAAAATCATCATGGTTGGAAATTTGGGAGGAGATTATTTGTGAACTTGTTAC 
CCTTTTGGTAATGGTGGACTAATTGCTGTATAGTTATTTTTGTTTTATTATTACTGTTACATTAATTTAACATGCATTTA 
TAGAAGAATACATTCAAAGCACTGATGTAGGAGATACACGGTACTTGGAGCAGTCAGCCAAAAATCACAGATACTGCTTT 
CACTTAAATGGAAACAATTCTCCGATAATGCTTTGCTTTTTTTCTTATGTCACTCTTGTGTACTATCTATTTTTCTCCTC 
TCTGGGACCAAGTTTCTTTTTATAAAGCAATAATATCTCTGTTTTCATTTCAGAACATTGTGCTGTCTGTCAGCATATGT 
ATATCAGCTACAAAATATATTCAACTTTGACTTCTTTTGACAAAGGACTTTAGGAAAAGGAGGAACAAAGACATTATTTG 
AGAATTAAATTATATATTTTTAATATGACTGTGACCTTGACTGATAATAAAGATGTAATAAGAATTGCAAGCTAAAAAAA 
AAAAAAAAAAAACTCG 
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Exon 1A (- 216 to -95, ' P2 ■ ) 

CCCATTAGGAGGTCTACCCACCCTGATTGGAGGACATGGGGCTGGAGGCTGAAGATGGAGACCAGATCACAAGGGATCTAATAAG 
C C T T GCT G AAC AGT TTGAGCTTTATTC T AC AG ACT CT -95 

Exon IB (-381 to -95, ' GR3 . 3 ' ) 

AAATTTTAGTTGGAGAATGCACAGGAGAAGGTGTAGACATAGTTTTGTCTGTGTCAGACACCTTGATAGCAAACACTATAGGTAG 
AGGCTTCTGGAGGCAGGATTGCTCTTAGTCTAAGGTCTTATGGTAACATTTGCAGGGGATACGGGTTAGCATTACAAAATGGGGA 
TGCTTCAGGAAGGGCGTGGGCGAGTGCTTCTTGCTATTACTTCTTGTAACCAAAGCTAGTCTAGGCTTTTGGGTATGTTCAACAC 
TGCATGGCCTTTGGAGCACTTTTATGGGAAGA -95 

Exon 1C " (-415 to -95, 'GR3.4') 

AAAGACAGGAACGCACTGCCACATTGAGAGGATAGCATTTTAGTTGGAGAGTGTACAGGAGAAGGTGTAGACATAGTTTTGTCTG 
TGTCAGACACCTNGATAGCAAACACTATAGGTAGAGGCTTCTGGAGGCAGGATTGCTCTTAGTCTAAGGTCTTATGGTAACATTT 
GCAGGGGATACGGGTTAGCATTACAAAATGGGGATGCTTCAGGAAGGGCGTGGGCGAGTGCTTCTTGCTATTACTTCTTGTAACC 
AAAGCTAGTCTAGGCTTTTGGGTATGTTCAACACTGCATGGCCTTTGGAGCACTTTTATGGGAAGA - 95 

Exon ID (-186 to -95, 'GR8.2') 

AATTAGAAACAACGGGGTTCTGCGACAGAGTGGGCGCAGGGTCATCCACCGCAGGGAGAGTGTGTCCCTCGGGGACACGCGAGCG 
CTCCAGA -95 

Exon IE (-217 to -95, *GR8.3') 

AAGTTGAAGGGGTGCTGCTGCGGAGGAAGGGCATTAGAAACAACGGGGTTCTGCGACAGAGTGGGCGCAGGGTCATCCACCGCAG 
GGAGAGTGTGTCCCTCGGGGACACGCGAGCGCTCCAGA -95 

Exon IF (-166 to -95, 'GR8.4') 

AAACCAAAGCTAGTCT AGGCTTCTTGGGTATGTTCAACACTGCATGGCCTTTGGAGCACTTATATGGGAAGA - 9 5 
Exon 1G (-250 to -95, EST 850011) 

GTCCACCCGCCCGCCGAGGTCCGCCCGCCCGCCGAGACCCGCCCGCCGCCGCTGCCATGGCCGAAGTGCGCAAATTCACCAAACG 
GCTCAGCAAGCCTGGCACGGCGGCTGANATCCGGCAGACGTGTCTGAGGCCGTGCGGGGCTCCGTGGTGCT -95 

Exon 1H (-187 to -95, 'GR1988') 

AAATTAGAAACAACGGGGTTCTGCGACAGAGTGGGCGCAGGGTCATCCACCGCAGGGAGAGTGTGTCCCTCGGGGACACGCGAGC 
GCTCCAGA -95 



Exon II (-144 to -95, •GR2005') 

AACTCCCGGATGCGGCTGCACCAGCAACATCAGCTGCGGGGAGCCCGAGC -95 



Exon 1J (-131 to -95, 'GR1986') 
AACCCAGAAAGCCCCGGGCGCCCAAGCTCGGGCTGGC -95 



-94 



G 


GAA 


AAG 


GCC 


AAA 


GTT 


GTT 


GAG 


CCC 


CTG 


GAC 


TAT 


GAG 


AAT 


GTT 


ATT 


GCC 


CAA 


AGA 


AAA 


-37 


ACC 


CAG 


ATT 


TAC 


AGC 


GAC 


CCC 


CTC 


CGA 


GAT 


CTG 


CTT 


1 

ATG 


TTC 


CCA 


ATG 


GAA 


GAT 


ATA 


TCT 


24 


























met 


phe 


pro 


met 


glu 


asp 


ile 


ser 




ATC 


TCG 


GTG 


ATA 


GGT 


CGT 


CAG 


CGC 


AGA 


ACG 


GTG 


CAG 


TCT 


ACT 


GTA 


CCA 


GAA 


GAT 


GCT 


GAA 


84 


ile 


ser 


val 


ile 


gly 


arg 


gin 


arg 


arg 


thr 


val 


gin 


ser 


thr 


val 


pro glu 


asp 


ala 


glu 




AAG 


AGG 


GCC 


CAG 


AGT 


TTA 


TTT 


GTT 


AAA 


GAG 


TGT 


ATT 


AAA 


ACC 


TAT 


AGC 


ACA 


GAT 


TGG 


CAC 


144 


lys arg 


ala 


gin 


ser 


leu 


phe 


val 


lys 


glu 


cys 


ile 


lys 


thr 


tyr 


ser 


thr 


asp 


trp 


his- 





A 



1 of 7 



GTG 
val 


GTA 
val 


AAC 
asn 


TAC 
tyr 


AAG 
lys 


TAT 
tyr 


GAG 
glu 


GAC 
asp 


TTC 
phe 


TCT 
ser 


GGG 
gly 


GAC 
asp 


TTT 
phe 


CGA 
arg 


ATG 
met 


TTG 
leu 


CCA 
pro 


TGT 
cys 


AAA 
lys 


TCT 
ser 


204 


TTG 
leu 


AGA 
arg 


CCA 
pro 


GAA 
glu 


AAG 
lys 


ATT 
ile 


CCT 
pro 


AAT 

asn 


CAT 
his 


GTA 
val 


TTT 
phe 


GAG 
glu 


ATA 
ile 


GAT 
asp 


GAA 
glu 


GAC 
asp 


TGT 
cys 


GAG 
glu 


AAA 
lys 


GAT 
asp 


264 


GAG 
glu 


GAC 
asp 


TCA 

ser 


TCT 
ser 


TCT 
ser 


TTA 
leu 


TGT 
cys 


TCT 
ser 


CAG 
gin 


AAG 
lys 


GGT 
gly 


GGT 
gly 


GTG 
val 


ATA 
ile 


AAA 
lys 


CAA 
gin 


GGC 
gly 


TGG 
trp 


TTG 
leu 


CAT 
his 


324 


AAA 
lys 


GCA 
ala 


AAT 

asn 


GTA 
val 


AAT 
asn 


AGT 
ser 


ACC 
thr 


ATC 
ile 


ACA 
thr 


GTA 
val 


ACC 
thr 


ATG 
met 


AAG 
lys 


GTA 
val 


TTC 
phe 


AAG 
lys 


AGA 
arg 


CGA 
arg 


TAT 

tyr 


TTT 
phe 


384 


TAC 
tyr 


TTG 
leu 


ACC 
thr 


CAA 
gin 


CTT 
leu 


CCT 
pro 


GAC 
asp 


GGT 
gly 


TCA 
ser 


TAT 
tyr 


ATT 
ile 


CTC 
leu 


AAT 

asn 


TCC 
ser 


TAT 
tyr 


AAA 

lys 


GAT 
asp 


GAG 
glu 


AAA 
lys 


AAT 

asn 


444 


ser 


lyt 


glu 


ser 


AAA 

lys 


GGT 
gly 


TGC 

cys 


ATC 
ile 


TAC 
tyr 


TTG 
leu 


GAC 
asp 


GCC 
ala 


TGC 

cys 


ile 


asp 


val 


val 


CAG 
gin 


TGC 

cys 


CCC 
pro 


504 


AAA 
lys 


met 


CGC 

arg 


CGT 
arg 


CAT 
his 


GCT 

ala 


phe 


GAA 
glu 


CTC 
leu 


AAG 
lys 


ATG 

met 


TTA 
leu 


GAT 
asp 


lys 


tyr 


AGC 
ser 


CAT 

his 


TAT 

tyr 


CTG 
leu 


GCT 

ala 


564 


GCT 
ala 


GAA 

glu 


ACT 
thr 


GAG 
glu 


CAG 
gin 


GAA 
glu 


ATG 
met 


GAG 
glu 


GAA 
glu 


TGG 
trp 


TTG 
leu 


ATA 
ile 


ACT 
thr 


TTG 
leu 


AAA 
lys 


AAG 
lys 


ATT 
ile 


ATT 
ile 


CAG 
gin 


ATC 
ile 


624 


AAC 
asn 


ACC 
thr 


GAC 
asp 


AGT 
ser 


TTA 
leu 


val 


CAA 
gin 


GAA 
glu 


AAA 
lys 


AAG 
lys 


GAG 
glu 


ACG 
thr 


GTA 
val 


glu 


ACA 
thr 


GCA 
ala 


CAA 
gin 


GAT 
asp 


GAT 
asp 


GAA 
glu 


684 


ACT 
thr 


AGC 
ser 


AGC 
ser 


CAA 
gin 


GGA 
gly 


AAA 
lys 


GCC 
ala 


GAG 
glu 


AAC 
asn 


ATC 
ile 


ATG 
met 


GCA 
ala 


AGT 
ser 


TTG 
leu 


GAA 
glu 


AGG 
arg 


AGC 
ser 


ATG 
met 


CAT 
his 


CCG 
pro 


744 


GAA 
glu 


CTG 
leu 


ATG 
met 


AAG 
lys 


TAT 
tyr 


GGA 
gly 


AGA 
arg 


GAA 
glu 


ACT 
thr 


GAA 
glu 


CAA 
gin 


CTA 
leu 


AAC 
asn 


AAA 
lys 


CTC 
leu 


AGT 
ser 


AGA 
arg 


GGA 
gly 


GAT 
asp 


GGA 
gly 


804 


arg 


gin 


asn 


leu 


phe 


TCT 
ser 


phe 


GAT 
asp 


TCA 
ser 


GAA 
glu 


GTT 
val 


CAG 
gin 


arg 


leu 


asp 


phe 


ser 


GGA 
gly 


ATT 
ile 


GAA 
glu 


864 


CCT 
pro 


GAT 
asp 


ATA 
ile 


AAG 
lys 


CCA 
pro 


phe 


GAA 
glu 


GAA 
glu 


AAA 
lys 


TGC 

cys 


AAT 
asn 


AAA 
lys 


CGT 
arg 


TTC 
phe 


CTG 
leu 


GTG 
val 


AAT 
asn 


TGC 
cys 


CAT 
his 


GAT 
asp 


924 


TTA 
leu 


ACT 
thr 


TTC 
phe 


AAT 
asn 


ATC 
ile 


TTG 
leu 


GGC 
gly 


CAA 
gin 


ATT 
ile 


GGA 
gly 


GAC 
asp 


AAT 
asn 


GCA 
ala 


AAA 
lys 


GGA 
gly 


CCA 
pro 


CCC 
pro 


ACA 
thr 


AAT 

asn 


GTT 
val 


"984 


GAG CCC 
glu pro 


phe 


phe 


ATC 
ile 


AAT 
asn 


CTT 
leu 


GCC 
ala 


TTA 
leu 


phe 


GAT 
asp 


GTA 
val 


AAG 
lys 


AAC 
asn 


AAT 
asn 


TGT 
cys 


AAG 
lys 


ATT 
ile 


TCA 
ser 


GCA 
ala 


1044 


GAC TTT 
asp phe 


CAT 
his 


GTA 
val 


GAC 
asp 


CTG 
leu 


AAT 
asn 


CCC 
pro 


CCA 
pro 


TCT 
ser 


GTC 
val 


CGT 
arg 


GAA 
glu 


ATG 
met 


CTG 
leu 


TGG 
trp 


GGC 
gly 


TCT 
ser 


TCA 
ser 


ACC 
thr 


1104 


CAA 
gin 


CTG 
leu 


ala 


ser 


asp 


gly 


ser 


pro 


ly"s 


gly 


ser 


ser 


pro 


glu 


ser 


tyr 


ile 


his 


gly 


ile 




GCC 
ala 


GAA 
glu 


TCT 
ser 


CAG 
gin 


TTA 
leu 


CGC 
arg 


TAC 
tyr 


ATA 
ile 


CAA 
gin 


CAG 
gin 


GGA 
gly 


ATT 
ile 


TTC 
phe 


TCA 
ser 


GTG 
val 


ACG 
thr 


AAT 
asn 


CCA 
pro 


CAT 
his 


CCT 
pro 


1224 


GAA 
glu 


ATT 
ile 


TTT 
phe 


CTA 
leu 


GTT 
val 


GCC 
ala 


AGA 
arg 


ATT 
ile 


GAA 
glu 


AAG 
lys 


GTA 
val 


CTA 
leu 


CAG 
gin 


GGA 
gly 


AAC 
asn 


ATT 
ile 


ACA 
thr 


CAC 
his 


TGT 
cys 


GCA 
ala 


1284 



A 



2 of 7 



GAA 
glu 


CCC 
pro 


TAT 
tyr 


ATC 
ile 


AAA 

lys 


AAT 
asn 


TCT 
ser 


GAT 
asp 


CCA 
pro 


GTA 
val 


AAG 
lys 


ACG 
thr 


GCC 
ala 


CAG 
gin 


AAG 
lys 


GTG 
val 


CAC 
his 


AGG 
arg 


ACA 
thr 


GCT 
ala 


1344 


AAA CAA 
lys gin 


GTG 
val 


TGT 
cys 


AGC 
ser 


CGC 
arg 


CTT 
leu 


GGA 
gly 


CAA 
gin 


TAC 
tyr 


AGA 
arg 


ATG 
met 


CCC 
pro 


TTC 
phe 


GCT 
ala 


TGG 
trp 


GCT 
ala 


GCC 
ala 


AGA 
arg 


CCC 
pro 


1404 


ATT 
ile 


TTC 
phe 


AAA 
lys 


GAT 
asp 


ACT 
thr 


CAA 
gin 


GGC 
gly 


TCT 
ser 


CTT 

leu 


GAT 
asp 


CTG 
leu 


GAT 
asp 


GGG 
gly 


AGA 
arg 


TTT 
phe 


TCT 
ser 


CCT 
pro 


CTG 
leu 


TAT. 
tyr 


AAA 
lys 


1464 


CAA GAC 
gin asp 


AGT 
ser 


AGC 
ser 


AAG 
lys 


CTT 
leu 


TCA 

ser 


AGT 
ser 


GAA 
glu 


GAC 
asp 


ATT 
ile 


CTC 
leu 


AAG 
lys 


TTG 
leu 


CTC 
leu 


TCA 
ser 


GAA 
glu 


TAT 
tyr 


AAG 
lys 


AAG 
lys 


1524 


CCA 
pro 


GAA 
glu 


AAG 
lys 


ACC 
thr 


AAA 
lys 


CTG 
leu 


CAG 
gin 


ATT 
ile 


ATT 
ile 


CCT 
pro 


GGG 
gly 


CAG 
gin 


CTA 
leu 


AAC 
asn 


ATC 
ile 


ACA 
thr 


GTA 
val 


GAA 
glu 


TGT 
cys 


GTT 
val 


1584 


CCT 
pro 


GTG 
val 


GAT 
asp 


TTA 
leu 


TCA 
ser 


AAT 
asn 


TGT 
cys 


ATT 
ile 


ACT 
thr 


TCT 
ser 


TCA 
ser 


TAT 
tyr 


GTG 
val 


CCC 
pro 


TTG 
leu 


AAG 
lys 


CCT 
pro 


TTT 
phe 


GAA 
glu 


AAG 
lys 


1644 


AAT 
asn 


TGC 
cys 


CAA 
gin 


AAT 
asn 


ATT 
ile 


ACT 
thr 


GTG 
val 


GAG 
glu 


GTT 
val 


GAA 
glu 


GAG 
glu 


TTT 
phe 


GTT 
val 


CCA 
pro 


GAA 
glu 


ATG 
met 


ACA 
thr 


AAA 
lys 


TAT 
tyr 


TGT 
cys 


1704 


TAT 
tyr 


CCA 
pro 


TTT 
phe 


ACT 
thr 


ATT 
ile 


TAC 
tyr 


AAA 
lys 


AAC 
asn 


CAT 
his 


CTG 
leu 


TAT 
tyr 


GTA 
val 


TAT 
tyr 


CCC 
pro 


CTG 
leu 


CAA 
gin 


TTA 
leu 


AAA 
lys 


TAC 
tyr 


GAT 
asp 


1764 


AGC 
ser 


CAG 
gin 


AAA 
lys 


ACA 
thr 


TTT 
phe 


GCC 
ala 


AAG 
lys 


GCA 
ala 


AGG 
arg 


AAC 
asn 


ATT 
ile 


GCA 
ala 


GTC 
val 


TGT 
cys 


GTG 
val 


GAA 
glu 


TTC 
phe 


CGG 
arg 


GAT 
asp 


TCA 
ser 


1824 


GAT 
asp 


GAA 
glu 


AGT 
ser 


GAC 
asp 


GCT 
ala 


AGT 
ser 


GCC 
ala 


CTA 
leu 


AAG 
lys 


TGT 
cys 


ATT 
ile 


TAT 
tyr 


GGA 
gly 


AAA 
lys 


CCT 
pro 


GCA 
ala 


GGG 
gly 


TCT 
ser 


GTT 
val 


TTT 
phe 


1884 


ACC 
thr 


ACA 
thr 


AAT 
asn 


GCT 
ala 


TAT 
tyr 


GCT 
ala 


GTT 
val 


GTC 
val 


TCG 
ser 


CAT 
his 


CAC 
his 


AAC 
asn 


CAA 
gin 


AAT 
asn 


CCA 
pro 


GAG 
glu 


TTC 
phe 


TAT 
tyr 


GAT 
asp 


GAG 
glu 


1944 


ATT 
ile 


AAA 
lys 


ATT 
ile 


GAG 
glu 


CTT 
leu 


CCC 
pro 


ATT 
ile 


CAC 
his 


CTA 
leu 


CAT 
his 


CAA 
gin 


AAA 
lys 


CAT 
his 


CAT 
his 


TTG 
leu 


CTT 
leu 


TTC 
phe 


ACT 
thr 


TTT 
phe 


TAT 
tyr 


2004 


CAT 
his 


GTA 
val 


AGT 
ser 


TGT 
cys 


GAA 
glu 


ATT 
ile 


AAC 
asn 


ACA 
thr 


AAG 
lys 


GGA 
gly 


ACA 
thr 


ACC 
thr 


AAA 
lys 


AAG 
lys 


CAA 
gin 


GAC 
asp 


ACA 
thr 


GTT 
val 


GAA 
glu 


ACT 
thr 


2064 


CCA 
pro 


GTT 
val 


GGG 
gly 


TTT 
phe 


GCC 
ala 


TGG 
trp 


GTA 
val 


CCT 
pro 


TTG 
leu 


CTG 
leu 


AAA 
lys 


GAT 
asp 


GGT 
gly 


AGA 
arg 


ATC 
ile 


ATC 
ile 


ACA 
thr 


TTT 
phe 


GAG 
glu 


CAG 
gin 


2124 


CAG CTG 
gin leu 


CCA 
pro 


GTT 
val 


TCC 
ser 


GCC 
ala 


AAT 
asn 


CTT 
leu 


CCC 
pro 


CCA 
pro 


GGC 
gly 


TAC 
tyr 


TTG 
leu 


AAT 
asn 


CTG 
leu 


AAT 
asn 


GAT 
asp 


GCA 
ala 


GAA 
glu 


TCA 
ser 


2184 


AGA 
arg 


AGG 
arg 


CAA 
gin 


TGT 
cys 


AAC 
asn 


GTG 
val 


GAT 
asp 


ATT 
ile 


AAA 
lys 


TGG 
trp 


GTA 
val 


GAT 
asp 


GGT 
gly 


GCA 
ala 


AAG 
lys 


CCT 
pro 


TTG 
leu 


TTG 
leu 


AAG 
lys 


TTT 
phe 


2244 


AAA 
lys 


AGC 
ser 


CAC 
his 


TTA 
leu 


GAA 
glu 


TCT 
ser 


ACC 
thr 


ATT 
ile 


TAC 
tyr 


ACT 
thr 


CAA 
gin 


GAT 
asp 


CTG 
leu 


CAT 
his 


GTG 
val 


CAC 
his 


AAA 
lys 


TTC 
phe 


TTC 
phe 


CAT 
his 


2304 


CAT 
his 


TGC 
cys 


CAG 
gin 


CTG 
leu 


ATT 
ile 


CAG 
gin 


TCA 
ser 


GGC 
gly 


TCG 
ser 


AAA 

lys 


GAA 
glu 


GTT 
val 


CCA 
pro 


GGG 
gly 


GAG 
glu 


CTC 
leu 


ATT 
ile 


AAA 
lys 


TAT 
tyr 


TTA 
leu 


2364 


AAG TGT 
lys cys 


TTG 
leu 


CAT 
his 


GCC 
ala 


ATG 
met 


GAG 
glu 


ATC 
ile 


CAA 
gin 


GTC 
val 


ATG 
met 


ATA 
ile 


CAG 
gin 


TTT 
phe 


CTA 
leu 


CCT 
pro 


GTA 
val 


ATT 
ile 


CTT 
leu 


ATG 
met 


2424 


CAA CTC 
gin leu 


TTC 
phe 


CGA 
arg 


GTT 
val 


CTC 
leu 


ACA 
thr 


AAT 
asn 


ATG 
met 


ACC 
thr 


CAT 
his 


GAA 
glu 


GAT 

asp 


GAC 
asp 


GTT 
val 


CCT 
pro 


ATC 
ile 


AAC 
asn 


TGC 
cys 


ACC 
thr 


2484 



A 



3 of 7 



ATG 
met 


GTT 
val 


CTC 
leu 


TTA 
leu 


CAT 
his 


ATT 
ile 


GTA 
val 


TCA 
ser 


AAG 
lys 


TGC 
cys 


CAT 
his 


GAA 
glu 


GAA 
glu 


GGC 
gly 


TTG 
leu 


GAT 
asp 


AGT 
ser 


TAT 
tyr 


CTA 
leu 


AGA 
arg 


2544 


TCA 
ser 


TTC 
phe 


ATA 
ile 


AAG 
lys 


TAT 
tyr 


AGC 
ser 


TTC 
phe 


CGA 
arg 


CCT 
pro 


GAA 
glu 


AAA 
lys 


CCG 
pro 


AGT 
s r 


GCT 
ala 


CCT 
pro 


CAG 
gin 


GCC 
ala 


CAG 
gin 


CTG 
leu 


ATA 
ile 


2604 


CAT 
his 


GAA 
glu 


ACC 
thr 


CTG 
leu 


GCT 
ala 


ACT 
thr 


ACG 
thr 


ATG 
met 


ATA 
ile 


GCA 
ala 


ATA 
ile 


TTG 
leu 


AAA 
lys 


CAG 
gin 


TCT 
ser 


GCA 
ala 


GAT 
asp 


TTT 
phe 


TTA 
leu 


TCA 
ser 


2664 


ATA 
ile 


AAC 
asn 


AAA 
lys 


TTG 
leu 


CTA 
leu 


AAG 
lys 


TAC 
tyr 


TCA 
ser 


TGG 
trp 


TTT 
phe 


TTC 
phe 


TTT 
phe 


GAA 
glu 


ATA 
ile 


ATT 
ile 


GCA 
ala 


AAG 
lys 


TCA 
ser 


ATG 
met 


GCC 
ala 


2724 


ACA 
thr 


TAC 
tyr 


TTG 
leu 


TTG 
leu 


GAA 
glu 


GAG 
glu 


AAT 
asn 


AAG 
lys 


ATT 
ile 


AAG 
lys 


CTT 
leu 


CCC 
pro 


CGA 
arg 


GGC 
gly 


CAG 
gin 


AGA 
arg 


TTT 
phe 


CCC 
pro 


GAG 
glu 


ACA 
thr 


2784 


TAT 
tyr 


CAT 
his 


CAT 
his 


GTC 
val 


TTA 
leu 


CAT 
his 


TCA 
ser 


CTG 

leu 


CTT 
leu 


CTT 
leu 


GCA 
ala 


ATA 

ile 


ATT 

ile 


CCC 
pro 


CAT 
his 


GTG 
val 


ACT 
thr 


ATT 

ile 


CGG 
arg 


TAT 
tyr 


2844 


GCG 
ala . 


GAG 
glu 


ATT 
ile 


CCC 
pro 


GAT 
asp 


GAG 
glu 


TCC 
ser 


AGA 
arg 


AAT 
asn 


GTG 
val 


AAC 
asn 


TAT 
tyr 


AGT 

ser 


TTG 
leu 


GCT 
ala 


AGC 
ser 


TTC 
phe 


CTG 
leu 


AAG 
lys 


CGC 
arg 


2904 


TGT 
cys 


TTG 
leu 


ACA 
thr 


CTA 
leu 


ATG 
met 


GAT 
asp 


AGA 
arg 


GGA 
gly 


TTT 
phe 


ATT 
ile 


TTC 
phe 


AAT 
asn 


TTA 
leu 


ATA 
ile 


AAT 

asn 


GAC 
asp 


TAT 

tyr 


ATA 
ile 


TCT 

ser 


GGA 
gly 


2964 


TTC 
phe 


AGC 
ser 


CCC 
pro 


AAA 
lys 


GAT 
asp 


CCT 
pro 


AAG 
lys 


GTT 
val 


CTG 
leu 


GCT 
ala 


GAA 
glu 


TAC 
tyr 


AAG 
lys 


TTT 
phe 


GAA 
glu 


TTT 
phe 


CTG 
leu 


CAA 
gin 


ACA 
thr 


ATT 
ile 


3024 


TGC 
cys 


AAT 
asn 


CAC 
his 


GAA 
glu 


CAT 

his 


TAC 
tyr 


ATT 
ile 


CCT 
pro 


CTG 
leu 


AAC 
asn 


TTG 
leu 


CCA 
pro 


ATG 
met 


GCA 
ala 


TTT 
phe 


GCA 
ala 


AAA 

lys 


CCT 
pro 


AAA 
lys 


CTG 
leu 


3084 


CAG 
gin 


CGG 
arg 


GTT 
val 


CAA 
gin 


GAT 
asp 


TCA 
ser. 


AAT 
asn 


CTT 

leu 


GAA 
glu 


TAC 
tyr 


AGT 
ser 


TTA 

leu 


TCA 

ser 


GAT 
asp 


GAG 
glu 


TAT 
tyr 


TGC 
cys 


AAG 
lys 


CAT 

his 


CAC 
his 


3144 


TTC 
phe 


TTG 
leu 


GTT 
val 


GGT 
gly 


CTA 
leu 


CTT 
leu 


CTG 
leu 


AGG 
arg 


GAA 
glu 


ACT 
thr 


TCC 
ser 


ATT 
ile 


GCT 
ala 


CTT 

leu 


CAG 
gin 


GAC 
asp 


AAT 
asn 


TAT 
tyr 


GAG 
glu 


ATC 
ile 


3204 


AGA 
arg 


TAT 
tyr 


ACA 
thr 


GCT 
ala 


ATC 
ile 


TCT 

ser 


GTT 
val 


ATA 
ile 


AAG 
lys 


AAT 

asn 


CTT 
leu 


TTG 
leu 


ATA 

ile 


AAA 

lys 


CAT 

his 


GCA 
ala 


TTT 
phe 


GAC 

asp 


ACA 
thr 


AGA 
arg 


3264 


TAC CAG CAC 
tyr gin his 


AAG 
lys 


AAC 
asn 


CAA 
gin 


CAA 
gin 


GCC 
ala 


AAA 
lys 


ATA 
ile 


GCA 

ala 


CAA 
gin 


TTG 
leu 


TAC 
tyr 


CTC 
leu 


CCC 
pro 


TTT 
phe 


GTT 
val 


GGA 
gly 


CTA 
leu 


3.324 


CTT 

leu 


TTG 
leu 


GAA 
glu 


AAT 

asn 


ATA 
ile 


CAG 
gin 


CGA 
arg 


TTA 
leu 


GCA 
ala 


GGT 

giy 


CGA 
arg 


GAT 
asp 


ACC 
thr 


TTG 
leu 


TAT 
tyr 


TCT 

ser 


TGT 
cys 


GCA 
ala 


GCC 
ala 


ATG 
met 


3384 


CCT 
pro 


AAT 
asn 


TCT 

ser 


GCA 
ala 


TCC 
ser 


AGA 
arg 


GAT 
asp 


GAG 
glu 


TTT 

phe 


CCA 
pro 


TGT 

cys 


GGC 
gly 


TTT 
phe 


ACT 
thr 


TCA 
ser 


CCT 
pro 


GCC 
ala 


AAT 

asn 


AGA 
arg 


GGG 
gly 


3444 


AGT 
ser 


CTG 
leu 


AGC 
ser 


ACT 
thr 


GAC 
asp 


AAA 
lys 


GAC 
asp 


ACC 
thr 


GCT 
ala 


TAT 
tyr 


GGG 
gly 


TCT 
ser 


TTT 
phe 


CAA 
gin 


AAT 
asn 


GGA 
gly 


CAT 
his 


GGA 
gly 


ATT 
ile 


AAG 
lys 


3504 


AGA GAA GAT 
arg glu asp 


TCA 
ser 


AGA 
arg 


GGT 
gly 


TCC 
ser 


CTC 
leu 


ATC 
ile 


CCA 
pro 


GAA 
glu 


GGA 
gly 


GCA 
ala 


ACA 
thr 


GGA 
gly 


TTT 
phe 


CCA 
pro 


GAT 
asp 


CAG 
gin 


GGC 
gly 


3564 


AAC 
asn 


ACT GGT 
thr gly 


GAA 
glu 


AAT 
asn 


ACC 
thr 


CGA 
arg 


CAG 
gin 


AGT 
ser 


TCT 
ser 


ACA 
thr 


AGG 
arg 


AGT 
ser 


AGT 
ser 


GTA 
val 


TCC 
ser 


CAG 
gin 


TAT 
tyr 


AAC 
asn 


CGC 
arg 


3624 



FIG.^ 
4 of 7 



CTG 
leu 


GAT 


CAG 


TAT 
tyr 


GAA 


ATC 

ile 


AGA 
arg 


TCA 


GAA 


GAT 


ACT 
thr 


CTC 
leu 


TTA 
leu 


ACT 
thr 


CTT 
leu 


ATA 
ile 


CTT 
leu 


TTA 
leu 


GAA 
glu 


GTA 
val 


TGC 
cys 


AGG 


GTG. 
val 


CAT 
his 


GAT 
asp 


GCC 
ala 


TGG 
trp 


CTG 
leu 


CCT 


GCT 
ala 


CTT 
leu 


CGA 


AAC 


AGA 
arg 


TCA 
ser 


GAA 
glu 


AGT 


TCA 


TTT 


ACA 
thr 


CTT 


AAT 

asn 


GCA 


CTT 
leu 


CTT 


GAA 


GGC 


AAT 


ACA 
thr 


TTT 


TTC 


ACT 
thr 


CAG 
gin 


TGC 


TTC 


AAG 
lys 


AAA 
lys 


AAA 
lys 


GTG 
val 


TTT 


GAT 


ATA 
ile 


CAT 
his 


AAA 
lys 


CAT 
his 


GTA 
val 


TTT 


GCC 


TCA 


CTG 
leu 


GGA 


AGA 


GTA 
val 


AAC 


ATG 


TGT 


GCT 
ala 


ATT 
ile 


AGC 


TCA 


ACC 
thr 


AGG 


AAT 


GAA 


GAG 


TAT 


ACC 
thr 


AAA 
lys 


AGG 


AAA 

lys 


ACC 
thr 


CAA 


CTG 
leu 


ATA 
ile 


GCT 


GAT 


GTA 
val 


GCA 


ATC 
ile 


AAT 


AAT 


TTT 


GCA 
ala 


AAT 
asn 


AGT 
ser 


AAA 
lys 


GAC 
asp 


TTG 
leu 


ACC 
thr 


AAG 
lys 


AGA 
arg 


ATC 
ile 


GAG 
glu 


AAA 
lys 


GAC 
asp 


CCT 
pro 


GAA 
glu 


ATG 
met 


CTA 
leu 


ACC CCA GAG CTC 
thr pro glu leu 


AGG 
arg 


AAA 
lys 


ACC 


GAT TTT 
asp phe 


TCA 
ser 


GAG 
glu 


GCT 
ala 


GCG 
ala 


ATG 
met 


CAT 
his 


CGA AAA 
arg lys 


AAA 
lys 


TTA 
leu 


TTT 
phe 


CCT 
pro 



AGC 
ser 


CTC 
leu 


CTG 
leu 


ATG 
met 


TGC 
cys 


TAC 
tyr 


CTG 
leu 


TAC 
tyr 


TGG 
trp 


AAT 
asn 


AAA 
lys 


GTA 
val 


TCA 
ser 


CCT 
pro 


TTG 
leu 


TTT 
phe 


CAC 
his 


TTT 
phe 


AGA 
arg 


TAT 
tyr 


ATG 
met 


TCA 
ser 


AAA 

lys' 


CAC 
his 


TTC 
phe 


GGA 
gly 


ATA 
ile 


GAC 
asp 


GGA 


GTA 
val 


ATG 
met 


CAG 


GCC 
ala 


CGG 


CTT 
leu 


CAC 

his 


AGT 


TCT 


ACA 
thr 


ACA 
thr 


ACT 
thr 


GAA 
glu 


GCT 


ACT 
thr 


GAA 


GTT 
val 


TCC 


CTA 

leu 


ACA 
thr 


ACC 
thr 


CAA 


CTT 


TTA 
leu 


AAT 


AAT 


GAT 


CTT 


GCT 


TTT 


CTT 


AAA 
lys 


AAT 


GGA 


AGA 


GCT 


TTC 


ATC 
ile 


AGT 


AAG 
lys 


TTT 


GCA 


TTT 


TGC 


TAT 
tyr 


GAG 


GTT 
val 


TTA 
leu 


GCA 

ala 


TCT 
ser 


GCA 
ala 


CTT 
leu 


TTG 


TAT 
tyr 


CTT 
leu 


TTT 


TTG 
leu 


AGG 


ACA 
thr 


CAT 
his 


CTA 
leu 


CAG 


CTA 
leu 


AGC 


GGA 


GGA 


TCA 


AGA 


TTT 


GAC 
asp 


AGA 
arg 


CCT 


ATG 
met 


AAG 
lys 


GCA 


ACT 
thr 


CGC 
arg 


ACT 
thr 


GTT 
val 


CTT 
leu 


ATG 
met 


GCC 
ala 


ACT 
thr 


ATT 
ile 


GAT 
asp 


CTC 
leu 


CAG 
gin 


TAT 
tyr 


AGC 
ser 


TTA 
leu 


TGG 
trp 


CTT 


GAT 
asp 


AGC 


ATG 


GCC 


AAG 
lys 


TGT 
cys 


TAT 
tyr 


GTC 
val 


CAT 

his 


GTA 
val 


GCA 
ala 


GCT 
ala 


AAC 
asn 


GGA 
gly 


TGT 
cys 


TCA 
ser 


GCG 
ala 


TTC 
phe 


AAG 
lys 



TAT ATA GTA AAA ATG ATT 3684 
tyr ile val lys met ile 

CAG GAG CTC ATA AAC ATT 37 4 4 
gin glu leu ile asn ile 

GGG AAA AGA AAC ATA GCA 3804 
gly lys arg asn ile ala 

CGA AAA TCG CAA ACC ATG 3864 
arg lys ser gin thr met 

CAG CAT CTT AGT AGC CTA 3924 
gin his leu ser ser leu 

GCA GAC ATT TTC CAC CAG 398 4 
ala asp ile phe his gin 

GTA CTA GAC ACC ATA TCA 4 04 4 
val leu asp thr ile ser 

GGC CAT AAC CCA TTA ATG 4104 
gly his asn pro leu met 

CAA TCT GAA GTG TCG CTG 4164 
gin ser glu val ser leu 

CCT TCA GCA TTT TTC AAA 4 224 
pro ser ala phe phe lys 

AAG TGC TGC ACA TCG AAG 4 284 
lys cys cys thr ser lys 

TTG ATG AGA AAC AAC TTT 4 34 4 
leu met arg asn asn phe 

ATA ATA ATT GCT GTA AGC 44 04 
ile ile ile ala val ser 

CAG GAG TCT TTA TTC ATT 4 4 64 
gin glu ser leu phe ile - 

GCC TTT CCC GCA GAA GTC 4 524 
ala phe pro ala glu val 

GCC CAA ATG AAG GAG CAT 4584 
ala gin met lys glu his 

GCC AAG TCC TAT GCA AGC 4 64 4 
ala lys ser tyr ala ser 

ATT CAT GTA AAA AAT GGA 4 704 
ile his val lys asn gly 

CTA GTT GCA GAG TTT CTT 4764 
leu val ala glu phe leu 

AAA ATT ACT CCC AAT ATA- 4 82 4 
lys ile thr pro asn ile 



Soil 



GAT GAA GAA GGA GCA ATG AAA 
asp glu glu gly ala met lys 

GTC TTG CTG GAG TTG CTA GAA 
val leu leu glu leu leu glu 

ATA ATT TCT GAG ATT TCC AAG 
ile ile ser glu ile ser lys 

AAA CTT ACT CAA GTT TAT AGA 
lys leu thr gin val tyr arg 

CAT ACA AAA AAG AGA CTT TTA 
his thr lys lys arg leu leu 

TTT GAA GAA GAA GAT GGA AAG 
phe glu glu glu asp gly lys 

GAA ATT TCC TTG AGA CTT GTT 
glu ile ser leu arg leu val 

ATA ATT CAG GAT TCA GAC AAG 
ile ile gin asp ser asp lys 

CAA GTT ACT TAT GTG AAG CCT 
gin val thr tyr val lys pro 

TTT GAA AGA AAT CAT AAT ATC 
phe glu arg asn his asn ile 

AAA AAA CAG GGC TGT ATA GAA 
lys lys gin gly cys ile glu 

TCG TTT CCT TAC GTG AAG AAG 
ser phe pro tyr val lys lys 

CCA ATT GAT GGT GCC ACT GAT 
pro ile asp gly ala thr asp 

TCC TCT ACT GAC GTG GAC ATG 
ser ser thr asp val asp met 

CAG GTC AAT GCT GGT CCA TTA 
gin val asn ala gly pro leu 

AAG TAT CCA CCT AAG AAA GTG 
lys tyr pro pro lys lys val 

TGC AGC ATT GCA CTT GAA CTA 
cys ser ile ala leu glu leu 

GAA GGG CTA AAG TCA AAT TTC 
glu gly leu lys ser asn phe 

CAG ATA TTA CAA GAA GAC ACA 
gin ile leu gin glu asp thr 



GAA GAT GCT GGG ATG ATG GAT 
glu asp ala gly met met asp 

CAA TGT GTG GAT GGC TTA TGG 
gin cys val asp gly leu trp 

TTG ATC GTT CCA ATT TAT GAG 
leu ile val pro ile tyr glu 

ACT CTT CAT GGA GCT TAC ACA 
thr leu his gly ala tyr thr 

GGC ACT TTC TTC AGA GTT GCC 
gly thr phe phe arg val ala 

GAG TAC ATC TAT AAA GAA CCA 
glu tyr ile tyr lys glu pro 

AAA CTT TAT GGT GAA AAG TTT 
lys leu tyr gly glu lys phe 

GTA AAT GCC AAA GAG CTT GAT 
val asn ala lys glu leu asp 

TAC TTT GAT GAC AAA GAA CTC 
tyr phe asp asp lys glu leu 

AGC AGA TTT GTT TTT GAG GCC 
ser arg phe val phe glu ala 

GAA CAG TGC AAA CGC CGT ACA 
glu gin cys lys arg arg thr 

AGG ATT CCT ATT AAC TGT GAA 
arg ile pro ile asn cys glu 

GAA ATA AAA GAT AAA ACT GCA 
glu ile lys asp lys thr ala 

ATT CAG CTC CAA CTT AAA TTG 
ile gin leu gin leu lys leu 

GCA TAT GCA AGA GCT TTC TTA 
ala tyr ala arg ala phe leu 

AGT GAG TTG AAA GAC ATG TTT 
ser glu leu lys asp met phe 

AAT GAG CGG CTA ATT AAA GAA 
asn glu arg leu ile lys glu 

AGA GAC ATG GTA AAA GAA TTA 
arg asp met val lys glu leu 

ATG CAT TCT CCC TGG ATG AGC 
met his ser pro trp met ser 



GTC CAT TAT AGT GAA GAG 4 884 
val his tyr ser glu glu 

AAG GCA GAA CGT TAT GAA 4 94 4 
lys ala glu arg tyr glu 

AAA CGT CGT GAG TTT GAG 5004 
lys arg arg glu phe glu 

AAA ATT CTG GAA GTT ATG 5064 
lys ile leu glu val met 

TTT TAT GGC CAA TCT TTT 5124 
phe tyr gly gin ser phe 

AAG CTC ACT GGC CTC TCA 518 4 
lys leu thr gly leu ser 

GGT ACG GAG AAT GTC AAA 524 4 
gly thr glu asn val lys 

CCA AAA TAT GCT CAT ATA 5304 
pro lys tyr ala his ile 

ACA GAA AGG AAG ACC GAG 5364 
thr glu arg lys thr glu 

CCT TAC ACT TTA TCA GGC 54 24 
pro tyr thr leu ser gly 

ATC TTG ACA ACT TCA AAC 54 84 
ile leu thr thr ser asn 

CAG CAG ATT AAT TTA AAA 554 4 
gin gin ile asn leu lys 

GAG CTG CAA AAG CTT TGC 5604 
glu leu gin lys leu cys 

CAG GGC TGG GTT TCT GTG "5664 
gin gly trp val ser val 

AAT GAC AGC CAA GCT AGC 57 2 4 
asn asp ser gin ala ser 

AGG AAA TTT ATA CAA GCA 57 84 
arg lys phe ile gin ala 

GAT CAA GTT GAG TAC CAT 584 4 
asp gin val glu tyr his 

TCT GAC ATT ATC CAT GAG 5904 
ser asp ile ile his glu 

AAC ACA TTA CAT GTA TTT 5964 
asn thr leu his val phe 
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TGT GCA ATT AGT GGT ACA TCA 

cys ala ile ser gly thr ser 

TGA GGA AAT GCA GAT GTA CGG 
OPA 

GTG CAA ATG TTA AAA TTT AAA 

TTT TCA TGT GTT CCA ACA GGG 

GAA AAT TGC ACC ACT GTG CTT 

AGA GCT CAA AAA CAG TAG TTC 

AAA TTG TAT AAT GGA ATA CAA 



AGT GAC CGA GGT TAT GGT TCC 
ser asp arg gly tyr gly ser 

TGA CAA TGA GAC TGA CCT TTC 

GAT TTG ATA TAC ATG GAG TGT 
TGC TTA CAT ATT TGT AAA TAA 
GGT TTG TAC TTT TTT AGG TAA 
AAT TTG CTT AAT TAT TGC TTA 
TAA AAG GTA AAA ATT 



CCA AGA TAC GCT GAA GTG 6024 
pro arg tyr ala glu val 

TCA GGA ATA TTT GGA 'GCT 6084 

TTC TTC TCd§ACA CCA AAA 614 4 

GCA ACT TGA AAG TGC CTG 6204 

ATC TAT ATG CTG AAA AGT 6264 

AAA TAA TGG TAC TAT GTA 6324 
6360 



